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| SCORE Search Results Details for Application 10 
| 20081 1 Q4_J 5 44 5 5,' us -I 0-576-S 

Scoi e Hon-i i Ret i [eve Application List SCORE System Overview SCORE FAQ Comments / Sugg 

This page gives you Search Results detail for the Application 1 0578781 and Search Result 20081 1 C 



GenCor e versi on 6. 3 
Copyright (c) 1993 - 2008 Bi occel er at i on Ltd. 



CM nucl ei c 
Run on: 



nucl ei c search, usi ng sw model 
November 4, 2008, 17:10:38 



Search time 1527 Seconds 
( wi t hout al i gnment s) 
45700. 779 M I I i on eel I updat es/ sec 



Perfect score: 
Sequence: 

Scor i ng t abl e: 



US- 10-578-781- 1 
756 

1 at gggt gecgat at caaaaa aggagcaaat t gaacat t ag 756 

I DENTI TY_NUC 

Gapop 10. 0 , Qapext 1. 0 

11299264 seqs, 46154159321 residues 



Total nurrber of hits satisfying chosen parameters: 

M ni nxim DB seq I engt h: 0 

Maxi rrum DB seq I ength: 2000000000 

Post - pr ocessi ng: M ni rrum Mat ch 0% 

Maxi rrum Mat ch 100% 

Li st i ng f i r st 45 surrrrar i es 



GenErrbl : * 
gb_env 
gb_pat 
gb_ph 
gb_pl 
gb_pr 
gb_ro 
gb_st s 
gb_sy: 
gb_un: 
gb_vi 
gb_ov 



gb_om * 
gb_ba: * 
gb_htg1: * 
gb_htg2:* 



Pred. No. i s t he nurrber of results predicted by chance to have a 
score greater than or equal t o t he score of the result being printed, 
and is der i ved by anal ysi s of t he t ot al score di st r i but i on. 



sul t Query 
No. Score Match Length DB ID 



Descr i pt i on 
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ALI GNMENTS 



RESULT 1 
DD160143 
LOCUS 

DEFI Nl Tl ON 

ACCESSI ON 
VERSI ON 
KEYWORDS 
SOURCE 
ORGANI SM 

REFERENCE 
AUTHORS 
Tl TLE 



DD160143 756 bp DNA linear PAT 23- NOV- 2005 

Novel Br evi baci I I us chosi nensi s and Producing Mat hod of Protein by 
Us i ng Ther eof as Host . 
DD160143 

DD160143. 1 Gl: 83960018 
W 2005045005- N 1. 
Br evi baci I I us choshi nensi s 
Br evi baci I I us choshi nensi s 

Bacteria; Firm cut es; Baci I I al es; Paeni baci I I aceae; Br evi baci I I us. 
1 (bases 1 to 756) 
Hanagat a, H. and Ni shi j yo, T. 

Novel Br evi baci I I us chosi nensi s and Producing Mat hod of Protein by 

Us i ng Ther eof as Host 

Patent: W 2005045005- A 1 19- MW- 2005; 

Hi get a Shoyu Co Lt d 

OS Br evi baci I I us choshi nensi s 
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PN W 2005045005- N 1 
PD 19-MW- 2005 

PF 08- NOV- 2004 WD 2004JP016912 
PR 11- NOV- 2003 JP 03P 381606 
PI hi roshi hanagat a, t akayuki nishijyo 

CC 

FH Key Locat i on/ Qual i f i er s. 

FEATURES Locat i on/ Qual i f i er s 

source 1 . . 756 

/ organi sm=" Brevi baci I I us choshi nensi s" 
/ rrol _t ype=" unassi gned DMA" 
/ db_xr ef =" t axon: 5491 1" 

ORI Q N 

Query Mat ch 100.0% Score 756; DB 2; Length 756; 

Best Local Si rri I ar i t y 100.0% Pred. No. 6.7e-232; 

Matches 756; Conservative 0; Msmatches 0; I ndel s 0; Gaps 

ATGG3JGOCGATATCAAAMTQGGAGJCAAGCATTTCTGACCM 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGGGTGCCGATATCAAAMTQCGACTCMCCAT^ 60 

TTGATAGCCMGAQOCMQCTQXGATACQ^ 1 2 

ATCAGACTQ3TCT03TCCGTCGTCCAGO33TTTA^ 1 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATCAGACTQ3TCTQ3TCCGTCGTCCAG033TTW 1 8 

TTGTTTCAGOTCGGTTQCATTQ3CTTGCTCMG 2 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Qy 
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Db 


661 


Qy 


721 


Db 


721 



301 CGG3^TG^CQ3TACQ3TTMGGTCAGTCGATCGT^ 360 

360 



3.1 HiimiMii^^ 

421 G^AGCAGTQ33MTGA03XGGAGGMGTAGTCTTT 480 

TCCTCCATCCATGAGACCCTTTTTGAAMTGACG^ 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCTCGATCCATGAG^003TTTTTGAAMTG^CG3CGATCOCATCACACTG^ 540 

ATAGOG^TGAAGGTGTGAACMGTGGTTTGAGAAMTTQO^ 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATAGOG^TGAAGGTGTGAACMGTQGnTTG^GAAM^ 600 

AGGCTGAGCGAGCGTGAGCAGCTCATCGT CTACCTGCGCT ATT ACAAQGATGAGAGAGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGGCTGAGCGAGCGTGAGGAGCTCATCGTCTACCTGCGCT 660 

TCTGAGGTAGCAGAGOGTCTAG33ATTTGQGAG3TCCAGGTCTa 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCTGAGGTAGCAGAGCGTCTAGQSATTTCGCAGGTCCAGG^ 720 

721 ATCCTGCTMCGATGAAGGAGCAAATTGAACATTAG 756 
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RESULT 2 
Dl 1 15290 

LOCUS Dl 115290 756 bp DNA linear PAT 21 - FEB- 2008 

DEFINITION Novel Br evi baci I I us chosi nensi s and Producing Ivfet hod of Proteinby 

Us i ng Ther eof as Host . 
ACCESSION Dl 115290 
VERSION Dl 115290.1 Gl: 168311716 
KEYWORDS KR 1020067008845- N 1. 
SOURCE Br evi baci I I us choshi nensi s 

ORGANI SM Br evi baci I I us choshi nensi s 

Bacteria; Firm cut es; Baci I I al es; Paeni baci I I aceae; Br evi baci I I us. 
REFERENCE 1 (bases 1 to 756) 

AUTHORS Hanagata, H. and Ni shi j yo, T. 

TITLE Novel Br evi baci I I us chosi nensi s and Producing Ivfet hod of Proteinby 

Us i ng Ther eof as Host 
JOURNAL Patent: KR 1 020067008845- A 1 08- M\Y- 2006; 
CCMVENT PN KR 1020067008845- A/ 1 

PD 2006- 05- 08 

PA HANAGATA, H. , NI SHI JYO, T. 

TY DNA 

OS Br evi baci I I us choshi nensi s 
CO. 

FEATURES Locat i on/ Qjal i f i er s 

source 1 . . 756 

/ organi sm=" Br evi baci I I us choshi nensi s" 
i=" unassi gned 
''t axon: 54911" 
H G N 

Query IVfet ch 100.0% Score 756; DB 2; Length 756; 

Best Local Si rri I ar i t y 100.0% Pred. No. 6.7e-232; 

Ivfet ches 756; Conservative 0; Msmatches 0; I ndel s 0; Gaps 0; 

ATOGSTGCCGATATCAAAAATQCGAGTCAAQCATTTCTGACCMTG^ 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGGGTQOCG^TATGAAAAATQCGAGTCAAOCATTTCTGACC^^ 60 

61 TTGATAQCGMGAQ0CMQCTQ3CGATAO33^TGCACX^ 120 

ATGAGACTQ3TCTQ3TCCGTCGTCGAGCG0TTTA^ 1 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGAGACTGCTCTGCTCCGTCGTCGAGCGCTTTATGAACCGCG^ 1 80 

TTGTTTGAG^TCGGTTQCATTQGCTTGCTCM^ 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGXATGACG3TACQ3TTMGGTCAGT0GATCGTTAA 
CGATCAMGCATGMTTGTACAAGCAATTCGGXGTQCO^ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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481 TCCTCGATCCATGA^CCmTTTGAA^^ 540 



I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 TCTGAGGTAGCAGAQCGTCTAQC^TTTOC^C^ ; 



BACSPQ 1746 bp DNA linear BCT 1 6- OCT- 1 997 

Paeni baci I I us pol yrryxa spol I AA, spol I AB, and si grra f act or genes, 
corrpl et e cds. 
L47358 

L47358. 1 G: 2529264 



RESULT 3 
BACSPC1 
LOCUS 

DEFI Nl Tl CM 

ACCESS! ON 
VERSI ON 
KEYWORDS 

SOURCE Paeni baci I I us pol yrryxa 

ORGANI SM Paeni baci I I us pol yrryxa 

Bacteria; Firm cut es; Baci I I al es; Paeni baci I I aceae; Paeni baci I I us. 
REFERENCE 1 (bases 1 to 1746) 

AUTHORS Par k, S. G and Yudki n, M D. 

Tl TLE Sequenci ng and phyl ogenet i c anal ysi s of the spol I A oper on from 

di verse Baci I I us and Paeni baci I I us speci es 
JOURNAL Gene 194 ( 1), 25-33 ( 1997) 
PUB^D 9266669 
CCMVENT GSDB: S: 46599. 
FEATURES Locat i on/ Qual i f i er s 

source 1 . . 1746 

/ organi sm=" Paeni baci I I us pol yrryxa" 
/ rrol _t ype=" genorri c DNA" 
/ db_xr ef =" t axon: 1406" 
/cl one I i b="DSM36" 



- 35_si gnal 



gene 
CDS 



gene 
CDS 



. 11 

/ i nf er ence=" non- exper i rrent al evidence, no additional 
det ai I s r ecorded" 
29. .33 

/ i nf er ence=" non- exper i rrent al evidence, no additional 

det ai I s r ecorded" 

70. .423 

/ gene=" spol I AA" 

70. .423 

/ gene="spol I AA" 

/ i nf er ence=" non- exper i rrent al evidence, no additional 

det ai I s r ecorded" 

/ codon_st art =1 

IX ransl _t abl e=11 

/ prot ei n_i d=" AAB81184. 1" 

/db_xref ="G : 2529265" 

/ 1 r ansl at i on=" MMLQ ElvEHHRGVL I VRLSGEL DHHTSDIWRIvQvDEAl QRRQCE 

HI VL SL KNL OF IVDSSGL GVI LGRYKLI NQKG3EIVAVCDVNPPVHRL L DM3GL FKI ^l 

YDNEVNALTEL EWS" 

420. . 875 

/ gene=" spol I AB" 

420. . 875 

/ gene=" spol I AB" 

/ i nf erence=" non- exper i mental evidence, no addi t i onal 
det ai I s r ecorded" 
/ codon_st art =1 
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/ 1 r ansl _t abl e=1 1 

/ prot ei n_i d=" AAB81 185. 1" 

/db_xref =" Gl : 2529266" 

/ 1 ransl at i on=" IVREGTGNNFiVBLQFAAKSENEAFARVAVAAFI SRLDPTMDELSD 
L KTWSEAVTNSI I HGYDSDPSGWTI KVGI EADVI TLWEDAGRQ EDLELAKQPLY 
TSKPEL ERSGM3FTI IVENFIVDEFEAVSEPGGGTSVRIVKKRI ESKKALYN' 
CDS 887. .1642 

/ i nf er ence=" non- exper i ment al evidence, no additional 

det ai I s r ecorded" 

/ codon_st art =1 

/ 1 r ansl _t abl e=1 1 

/ pr oduct =" si grra factor" 

/protei n_i d="AAB81186. 1" 

/db_xref ="Q : 2529267" 

/ 1 r ansl at i on=" MEAGGKKTSHSYL EDT EVKRL I AL SGSGDNDARETL I NSNI RLV 
V\6WQRFIVNRGYEPDDL FQ QGI GL L KSVDKFDLSYEVKFSTYAVPM I GEI GRFLRD 
DGTL KVSRSL KETANKVRKIvKDELSKPL NRL PTVKEVADELGVTPEDWFAGEANKPP 
TSI HETVFENDGDPI TLIvDQ ADESCERV\FDKLALNEAI DGLSERERLI VYLRYYRDQ 
TCSEVASRLQ SGVGVSRLEKKI LGQ RDQ AQ' 

CRI Q N 

Query Mat ch 48.4% Score 366.2; DB 14; Length 1746; 

Best Local Si rri I ar i t y 67.8% Pred. No. 5. 2e-106; 

Matches 512; Conservative 0; Msrratches 243; I ndel s 0; Gaps 0; 



Cy 1 ATQ33TQCGGATATCAAAMTQ0GACTCM(X^ 60 

Db 887 ATGGAAGOAGGAGGAAAAA^ 946 

Gy 61 TTG^TAQCGAAG^QGCMQCTQ^CG^TAGGG^TG3ACGTGAG3TTCTCGTGAATAG3AAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 947 CTTATCXX;ACTG^GTCMTC^Q3TG^TMTG^TG3TCGTGAG^CAC^^AATCAATAGTAAT 1006 

Cy 121 ATCAGACTGGTCTGGTCCXJn^^ 180 

III I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III 
Db 1007 ATGOGQCTGGTCTQGTCTGrOirGCAGCGCTTTATG^ 1066 

Gy 181 TTGTTTGAGATCGGTTGCATTGGCTTGCTGA^ 240 

Db 1067 

Gy 241 GATGTGAGATTTTCGA(XTATGCGCTGCCMTGAT 300 

II I I I I II II II I I I I I II I I I I I I I I II II II I I I I I I I I I I I I 
Db 1127 G^AGTGAAGTTCTCTAGATATGGAGTAOCAATGATTATTQGTGAGATTCAACX^TTCCTT 1186 

Gy 301 CGCGATG^CGGn"ACGGnTMGGTCAGTCG^ 360 

II II II I I I I I I I I I I I II II II II II I I I I I I I I I I I I I I I I II 
Db 1187 CGAG^CGATGGn"ACTCTCMGGTTAQCCGTTCXTTGAAGGAMGAGCC 1246 

Gy 361 CGATCAMGGATGMTTCTACMGGAATTOm^ 420 

Db 1247 AAAATG^G3^CGAGCTGTO 1306 

Gy 421 G^AG3AGTG3GMTGACGCCGGAGGAAGT AGTCTTTQCGCMGAQGCAAQCAG^GCGCCT 480 

1307 ^TUmUtmanMlUMim^TW^ 1366 

Gy 481 TCCTCGATCGATGAG^(XGTTTTTGAAMT 540 

Db 1367 

Gy 541 ATAGGGG^GAAGGTGn~GAACMGra3TTTGAGAAM^ 600 

Db 1427 aVtGCCGAC^ 1486 

Gy 601 AGGCTGAGCGAGGGrGAGCAGCTCATCGTCTACCT 660 

,487 ^ATtzMm*^ 1546 
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cy 
Db 



661 TCTGAG3TAGCAGAGCGTCTAG33ATTTCQCAG3TCC^^ 720 
II I I I I I III I I I I I II I I II I I I I I I I I I I I II II I I I I I I I 
1547 TCAGAG3TTGCMGTCX^CTTQGCATTTCTCAG3TO~ * ' 



QCA^At^C^TTa^AAAAAAG 1606 



721 ATCCTQjrAACGMCAAG^GCAAATTGAACATTA 755 
1607 ATTTTACAACAGATGCGCGAT^ 1641 



RESULT 4 

Z54161 

LOCUS 

DEFI Nl Tl ON 
ACCESS! ON 
VERSI ON 
KEYWORDS 
SOURCE 
CRGANI SM 

REFERENCE 
AUTHORS 
Tl TLE 

JOURNAL 
PUB^D 
REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 



FEATURES 
source 



Z54161 1799 bp DNA linear BCT 1 8- APR- 200 

B. coagul ans genes f or Spol I AA, Spol I AB and si gma f act or . 

Z54161 

Z54161.1 G: 1552541 

si gma f act or ; Spol I AA; Spol I AB. 

Baci I I us coagul ans 

Baci I I us coagul ans 

Bacteria; Firm cut es; Baci I I al es; Baci I I aceae; Bacillus. 

1 ( bases 1 t o 1799) 

Par k, S. G and Yudki n, M D. 

Nucleotide sequence of the Bacillus coagul ans horrol ogue of the 
spol I A oper on of Baci I I us subt i I i s 
Gene 177 ( 1-2), 275-276 ( 1996) 
8921882 

2 (bases 1 to 1799) 
Park, S. S. 

Di r ect Subrri ssi on 

Submitted ( 08- SEP- 1 995) Sung Goo SGPark, Mcrobiology Unit, 
Biochemistry, University of Oxford, South Parks Road, Oxford, OX1 
3QU, UK 

Locat i on/ Glial i f i ers 



- 35_si gnal 
- 10 si gnal 
CDS" 



1. 



. 17 



Baci I I us coagul ans" 
genorri c DNA" 



/ organi sm 
/ mo I _t ype: 
/ i sol at e=" DSM1 " 
/ db_xr ef =" t axon: 1398" 
75. .80 
98. .101 
137. .487 

/note="gtg start codon" 

/ codon_st ar t =1 

/t ransl _t abl e=11 

/ product ="Spol I AA" 

/ prot ei n_i d="CAA90872. 1" 

/db_xref ="GI : 1552542" 

/db_xref =" GOA: P70877" 

/ db_xr ef =" I nt er Pr o: I PR002645" 

/ db_xr ef =" I nt er Pr o: I PR003658" 

/ db_xr ef =" Uni Pr ot KB/ Swi ss- Pr ot : P70877" 

/ 1 ransl at i on=" M3LAI DLEVKKDVLCI RLQGEL DHHTAESL RASVTKAI EENG R 

HL VL NL EQL SF MDSSGL GVI LGRYKQ KQKNGEM VCAI SP AVKRL F EL SGL F Kl I RL 

DESEQYAL HRL GVA" 

488. .931 

/ codon_st art =1 

/t ransl _t abl e=11 

/ product ="Spol I AB" 

/protei n_i d="CAA90873. 1" 

/db_xref ="GI : 1552543" 

/db_xref =" GOA: P70878" 

/ db_xr ef =" I nt er Pr o: I PR003594" 

/db_xref =" I nterPro: I PR010194" 

/ db_xr ef =" Uni Pr ot KB/ Swi ss- Pr ot : P70878" 

/ 1 r ans I at i on=" IvKNKMXQFTARSRNESFARVRVAAFVAQL DPTL DELTGNQTW 
SEAVTNAI I HGYEENPDGWYVSAAI CDDGTVEI TVRDEGKG ADI EEARQPSFTTKP 
ELERAGVGFTI IVENFfvDDVEl G3QQESGTI I RLKKHLALKNALCN' 
944. .1702 
/ codon_st art =1 
I \ ransl _t abl e=11 
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/ pr oduct : 
/ pr ot ei n 
/db_xret: 
/db_xref = 
/db_xref = 
/db_xref = 
/db_xref = 
/db_xref = 
/db_xref : 
/db_xref : 
/db_xref : 
/db_xref : 



si grra f act or" 
d="CAA90874. 1" 
Gl : 1 552544" 
QQA: P70879" 
HSSP: P00579" 



I nt er Pr o: I PR000943" 
I nterPro: I PR007624" 
I nterPro: I PR007627" 
I nterPro: I PR007630" 
I nterPro: I PR013324" 
I nt er Pro: I PR013325" 
" Uni Pr ot KB/ Tr EM3L: P70879" 
/ 1 r ansl at i on=" l\/DAEL KKAENKARL KDEEVKNL I RRTEEGDQEAPDL I VEKNI RL 
VWBWHRFLNRGYEPDDLFQ G3I GL L KSVDKF DLSYDVKFSTYAVPM I GEI QRFLR 
DDGTVKVSRSL KETGNRI RKI RDELTKKLGRSPTVHEI ASELQVPVEEWM<\QEAGKA 
PSSI HETVYENDGDPI TLLDQ SDCNDCKV\FDG: ALQEAI RGLEERERLI VYLRYYKD 
QTG3EAGERLGI SQVGVSRL EKKI LRVIVKERVEL" 



Query IVat ch 

Best Local Si rri I ar i t y 

IVatches 494; Conserv, 



Score 363.2; DB 14; 
Pred. No. 4. 9e-105; 
0; M snatches 218; 



Qy 








Gy 








Gy 




Db 




Gy 




Db 


1170 


Gy 


284 


Db 


1230 


Gy 


344 


Db 


1290 


Gy 


404 


Db 


1350 


Qy 


464 


Db 


1410 


Qy 


524 


Db 


1470 


Qy 


584 


Db 


1530 


Qy 


644 


Db 


1590 


Qy 


704 



1 1 1 1 1 1 1 1 I 1 1 1 1 



I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



104 TTCT03TGMTAGGMTATGAGACTG3TCTG3TCCGTCGTCGAG3GGTTTATCAACCG^ 163 
U^cUA^Vk^W^^Tmcl^^AkklM ,109 



i ii mi ii him i nun i n n iiiiiiniii 

MTTT«TCTTTCGTAT«TCTC»MTTTTCCACCTATa»GrCCCCATGI\TCATCG333 1 229 
284 AMTTCMCajrTTTTQ3aX3AT«CG3rACQ3n"AAQ3rCAGrCGATCGrTAAAAQAAA 343 



I I 1 1 1 I 1 1 1 1 1 I I 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 

OCX3TGCATGAAATTGCATCAGAACT0333ArGC033^ 1409 



. I I I I I I I III 



I I I I I I I I I I I I 



.II II I I I I I I I I I I I I 



jTCCGAAGCTGGCGAAG 



I II II llllll II II I I III II 1 1 1 1 II III 
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1650 GACQ3CTTG^AMG^AAATACTTCG^Gn"^TGAAA»\QCX3GATQ3AATTCTA 1701 



RESULT 5 
AJ704768 
LOCUS 

DEFI Nl Tl ON 

ACCESSI ON 
VERSI ON 
KEYWORDS 



SOURCE 
CRGANI SM 

REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 

REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 

CCMVENT 
FEATURES 
source 



gene 
CDS 



oper on 

gene 

CDS 



AJ704768 3701 bp DNA linear BCT 1 0- MKY- 2005 

Baci I I us arryl ol i quef aci ens dacF gene, spol I A oper on and spoVAA gene 
(part i al ) , st rai n FZB42. 
AJ704768 

AJ704768. 1 Gl: 63146678 

ant i - si grra F f act or antagonist; ant i - si grraF f act or ; dacF gene; 
peni ci I I i n bi ndi ng prot ei n; si gF gene; Si grra F; spol I AA gene; 
spol I AB gene; spoVAA gene; St age V spor ul at i on pr ot ei n AA. 
Baci I I us arryl ol i quef aci ens FZB42 
Baci I I us arryl ol i quef aci ens FZB42 

Bacteria; Firm cut es; Baci I I al es; Baci I I aceae; Bacillus. 
1 

St ragi er , P. 

Corrparison of early sporulation genes in Bacilli and Qostridiae 

Unpubl i shed 

2 ( bases 1 t o 3701) 

Bor r i ss, R. 

Di r ect Subrri ssi on 

Subrri 1 1 ed ( 1 0- ^Y- 2004) BorrissR, Biology, Humboldt University, 
Chaussee Strasse 117, D- 10115, GERMANY 
spollAoperon 1290 - 3083. 

Locat i on/ Qual i f i er s 

1. .3701 

/ organi sm=" Baci I I us any I ol i quef aci ens FZB42" 

/ rrol _t ype=" genorri c DNA" 

/ st rai n=" FZB42" 

/ i sol at i on_source="soi I " 

/ db_xref ="t axon: 326423" 

/ count r y=" Ger many: Ber I i n- Brandenburg" 

123. .1292 

/ gene=" dacF" 

123. .1292 

/ gene="dacF" 

/ EC_nunter="3. 4. 16. 4" 

/ f unct i on=" requi r ed for spore cortex synthesis" 
/ not e=" put at i ve D- Al a- D- Al a car boxypept i dase" 
/ codon_st ar t =1 
/ 1 r ansl _t abl e=1 1 

/ pr oduct =" peni ci I I i n bi ndi ng pr ot ei n" 

/ prot ei n_i d="CAG28930. 1" 

/db_xref =" Gl : 63146679" 

/db_xref =" GOA: G50J84" 

/db_xref =" HSSP: P39042" 

/ db_xr ef =" I nt er Pr o: I PR001 967" 

/db_xref =" I nterPro: I PR012338" 

/db_xref =" I nterPro: I PR012907" 

/ db_xr ef =" Uni Pr ot KB/ Tr E^L: G50J84" 

/ 1 ransl at i on=" IVKRLI STL LI Gl L LTASAPSAFAKPDGKHTSEL AHEAKSAVL I E 
RDTGSI LYNKNSRERLAPASMTKI MTM_LI IVEALDKGKI KIVBDKVRTSEHAASIVGGSQ 
I FLEPGEEMTVKEIVLKQ Al ASGNDASVAIVAEYI AGSEEDFVSRMMKKAKELGLKDTS 
F KNPTGL PEKDHYSSAYDIVAKIvAKEL L KYDKI TQFTGTYEDYL RENTDKKFW.VNTNR 
LI KF YPGVDGVKTGFTGEAKYCL TATAKKGNIVRVI AWFGASTPKERNAQVTKIVL DYA 
FSQFKTHPLYKRDQ VGTVKVKKGKGKL I KLTTSEPI SL L AKKGENMDKVKKEVKI NG 
NVTAPVKKGEVLGSLVL KKDGKVLVESPVTAKDDIVEKAGFLTFL KRTM3DWTKFK" 
1290. .3083 
/ oper on=" spol I A" 

/ f unct i on=" expr essi on of early forespore genes" 
1388. .1741 
/ gene=" spol I AA" 
/ oper on=" spol I A" 
1388. .1741 
/ gene=" spol I AA" 
/ oper on=" spol I A" 

/ f unct i on=" bi ndi ng t o Spol I AB i n t he pr esence of ADP 
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gene 
CDS 



gene 
CDS 



gene 
CDS 



selectively i n t he forespore" 
/ codon_st art =1 
/ 1 r ansl _t abl e=1 1 

/ pr oduct =" ant i - si grra F f act or ant agoni st " 

/ prot ei n_i d=" CAG28931 . 1" 

/db_xref ="GI : 631 46680" 

/db_xref =" HSSP: P10727" 

/ db_xr ef =" I nt er Pr o: I PR002645" 

/ db_xr ef =" I nt er Pr o: I PR003658" 

/db_xref =" Uni Prot KB/ Tr EfvBL: C50J83" 

/ 1 ransl at i on=" IVBLQ DFHVKESVLCI RLTGEL DHHTAESL KCKVTESL EADQ R 

HI VLNLEDLTFMDSSGLGVI LGRYKQ KQLGGEIvWCAl SP AVRRL F DM3GL F Kl I RF 

EGSEQKALLTLGVAS" 

1738. .2178 

/ gene=" spol I AB" 

/ oper on=" spol I A" 

1738. .2178 

/ gene=" spol I AB" 

/ oper on=" spol I A" 

/ EC_nurrber="2. 7. 1.37" 

/ f unct i on=" bi ndi ng to si grraF in the pr esence of ATP bef or e 

sept at i on" 

/ codon_st ar t =1 

/t ransl _t abl e=11 

/ pr oduct =" ant i - si grraF f act or " 

/protei n_i d="CAG28932. 1" 

/db_xref ="GI : 63146681 " 

/db_xref =" OCA: C50J82" 

/db_xref =" HSSP: CB2727" 

/ db_xr ef =" I nt er Pr o: I PR003594" 

/ db_xr ef =" I nterPro: I PR010194" 

/db_xref ="Uni Prot KB/ Tr ElvBL: Q50J82" 

/ 1 ransl at i on=" IvKNEMHLEFSALSQMESFARVTVASFI AQLDPTIVDELTEI KTVV 

SEAVTNAI I HGYEENCDGKVYI SVTLEDHVVYLTI RDEQ Gl TDLEEARQPLFTTKPE 

LERSGM3FTI MENFMDDVTI DSSPEM3TTI RLTKHLSKSKALCN" 

2190. .2957 

/ gene=" si gF" 

/ operon=" spol I A" 

2190. .2957 

/ gene=" si gF" 

/ oper on=" spol I A" 

/ f unct i on=" spor ul at i on speci f i c si grra f act or " 
/ codon_st art =1 
/ 1 r ansl _t abl e 
/ pr oduct 
/ pr ot ei n_i 
/db_xref =' 
/db_xref = 
/db_xref = 
/ db_xr ef = 
/ db_xr ef = 
/ db_xr ef = 
/db_xref = 
/ db_xr ef = 
/db_xref = 
nsl at 



Si grra F" 
d="CAG28933. 1" 
Gl : 63146682" 
GCA: Q50J81" 
HSSP: P00579" 



I nt er Pr o: I 
I nt er Pr o: I 
I nt er Pr o: I 
I nt er Pr o: I 
I nt er Pr o: I 



: I PR000943" 
: I PR007624" 
: I PR007627" 
: I PR007630" 
: I PR013325" 
" Uni Pr ot KB/ Tr ElvBL: Q50J81 " 

" MDVEVKKNSKNAQL KDHEVKEL I KKSGEGDGOARDLLI EKNIVRL 
VWSWQRFLNRGYEPDDLFQ GCI GL L KSVDKF DLSYDVRFSTYAVPM I GEI QRFI R 
DDGT VKVSRSL KEL GNKI RRAKDELSKTYGRVPTVGEI ADHLEI DAEDWLACEAVRA 
PSSI HETVYENDGDPI TLLDQ ADHSEEKV\FDKI ALKEAI SDLEEREKLI VYLRYYKD 
QTGSEVAERLGI SQVGVSRLEKKI LKQ KVQVDHTES" 
3084. .>3701 
/ gene=" spoVAA" 
3084. .>3701 
/ gene=" spoVAA" 
/ codon_st art =1 
/t ransl _t abl e=11 

/ pr oduct =" St age V spor ul at i on pr ot ei n AA" 
/ prot ei n_i d="CAG28934. 1" 
/db xref =" Gl : 63146683" 
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/db_xref =" Uni Prot KB/ Tr EM3L: Q50J80" 

/ 1 ransl at i on=" MERRI FLRLRHRVKAHTGDI I Tl GDAAQ AQQQKLKKKLSALPL 
YKVTKKDKNI WLDI I KVLKTI HTAF PAL DVQTVQ3AETI VEI HYQKKQL SAVL F VGV 
W.LLFI GSCLAI MMFHEDVSIVREVHI ALYEI I TGERNEYPYLLQ PYSI GLGLGM VF 
FNHI FKKRL NEEPSPL EVEIVFNYQL DL DHYVAMHENEETIVKDL HDR' 

PI Q N 

Query Match 47.8% Score 361.6; DB 14; Length 3701; 

Best Local Si rri I ar i t y 67.9% Pred. No. 1.8e-104; 

Matches 505; Conservative 0; Msnratches 239; I ndel s 0; Gaps 0; 

GATATCAAAMTGCGAGTGMCGATTTCTGAC^^ 69 

I II Mill III II I I I I I I I Mill II II 

GTTMG'\AAMCAGGAAAMCG03CAGCTCAAGGACCATGMGrrCAMGAATTMTCAAA 2261 

MGAGOCAAGGTGQOGATAGGGATQGACGTCAGS^ 1 29 

I I I I I I I I I I I I I I I 
AAGAGCGAAGAGGGGGAC 

GTCTGGTCCGTCGTOCAC. ... 

I I I I I I I I I I I I I I I III I I I I I I I I II I I II I I I II I I I I I I I I I 
GTGTGGTCGGTGGTTCAMQ3TTCTTAMCAGAGXTATGAG3CQG^TG^TCTGTTTCAG 2381 

ATCGGTTGGATTGGCTTGCTCMGGCCGTTG^ 249 
I I I I I II II II III I II II li I II II I II I II I II II II I II I 
ATCQ3CTGn"ATCGGOTTGnTAAMTCCGTTG^TAAAT 2441 

250 TTTTOOKCTATGOQCTG^^ 309 

uuuinmcimMwum^ 

GGTACGGTTMGGTCAGrCGATGGTTAAM 369 

UjJJUWJJ^^ 256, 

GATGAATTGTA(^GCMTTOmjGn-G^ 429 

G^GAACTGTCAAAM^ 2621 

GGMTCACX3C033AQ3MGTAG^ 489 

CATGAG^CCGTTTTTGAAMTG^CGGCGATCCCATCACACTG^TCG^TCAGATAGO^ 549 

I I I I I I II II I I II II II I II I II II II II II I III I II I II I II II III 
CATG^GACXXjTGTATGAAMTG^CGG^GACCCG^ 2741 

GAAQGTGTGAAGMGTQGnTTG^GAAMTTGXT^ 609 
I I I I II I II I II I I I I I I I I I I I I II I I I I I I I I I II 
2742 CATTCAG^AG^G^AGTGGnTTG^CAAAATCGGXT 2801 

610 G^QGGn"G^QGAQ3TCATCGTCTACCTGOXTATTACAAGGATCAGACACAGn"CTG^ 669 

II II II I I II I I II I II III I I II I II I II I II II I I II I I II III 
GAACGG3AAAMCTCATTGTCTATCTGAG^TATTATAMGATCAGACACAGn"CGGAAGT A 2861 

G3AGAGGCTCTAGG3^TTTCGCAGCTCCAGGTCTCGCG^ 729 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I 
G3CGAACGGCTCGQ3^TTTCCCMGT(XAQCT^ 2921 

ACGATGMGGAGCAAATTGAACAT 753 

I I I I I I I I I I I I I I I I 



Cy 




Db 


2202 


Cy 








Cy 








Cy 


190 


Db 


2382 


Cy 




Db 


2442 


Cy 




Db 


2502 


Cy 




Db 


2562 


Cy 








Cy 


490 


Db 


2682 


Cy 


550 


Db 


2742 


Cy 


610 


Db 


2802 


Cy 


670 


Db 


2862 


Cy 


730 


Db 


2922 



RESULT 6 
CP000560_22/ c 
WCCMVENT 

Sequence split into 40 fragments LOCUS CP000560 Accession CP000560 
Fragment Name Begin End 
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CP000560 


00 


1 


110000 


CP000560" 


"01 


100001 


210000 


CP000560" 


"02 


200001 


310000 


CP000560 


03 


300001 


410000 


CP000560" 


"04 


400001 


510000 


CP000560" 


"05 


500001 


610000 


CP000560" 


"06 


600001 


710000 


CP000560" 


"07 


700001 


810000 


CP000560 


08 


800001 


910000 


CP000560" 


"09 


900001 


1010000 


CP000560" 


"10 


1000001 


1110000 


CP000560" 


11 


1100001 


1210000 


CP000560" 


"12 


1200001 


1310000 


CP000560 


13 


1300001 


1410000 


CP000560" 


"14 


1400001 


1510000 


CP000560" 


"15 


1500001 


1610000 


CP000560" 


"16 


1600001 


1710000 


CP000560" 


"17 


1700001 


1810000 


CP000560 


18 


1800001 


1910000 


CP000560" 


"19 


1900001 


2010000 


CP000560" 


"20 


2000001 


2110000 


CP000560" 


"21 


2100001 


2210000 


CP000560" 


"22 


2200001 


2310000 


CP000560 


23 


2300001 


2410000 


CP000560" 


"24 


2400001 


2510000 


CP000560" 


"25 


2500001 


2610000 


CP000560" 


"26 


2600001 


2710000 


CP000560 


27 


2700001 


2810000 


CP000560" 


"28 


2800001 


2910000 


CP000560" 


"29 


2900001 


3010000 


CP000560" 


"30 


3000001 


3110000 


CP000560" 


"31 


3100001 


3210000 


CP000560 


32 


3200001 


3310000 


CP000560" 


"33 


3300001 


3410000 


CP000560" 


"34 


3400001 


3510000 


CP000560" 


"35 


3500001 


3610000 


CP000560" 


36 


3600001 


3710000 


CP000560 


"37 


3700001 


3810000 


CP000560" 


"38 


3800001 


3910000 


CP000560" 


"39 


3900001 


3918589 



Continuation (23 of 40) of CP000560 from base 2200001 ( CP000560 Bacillus arryl ol i quef ac 



Query Mit ch 47. 8% 

Best Local Si rri I ar i t y 67.9% 
Ivatches 505; Conservative 



Score 361.6; DB 14; Lengt h 1 1 0000; 
Pred. No. 3. 1e-104; 
0; Msmatches 239; I ndel s 0; Q 



Cy 


10 


Db 


47250 


Gy 


70 


Db 


47190 


Gy 


130 


Db 


47130 


Gy 


190 


Db 


47070 


Gy 


250 


Db 


47010 


Gy 


310 


Db 


46950 



GTTAAGAAAMGAQGAA^ ' 



llllllllll Mill I I III I II II III I II II II I II 



ATCG3TTGGATTGQCTTGCTCMQGCXXXrGA^ 249 

I I I I I II II II III I II I I I I I I I II I I I I I I I II II I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



310 G3TACXX^TMG3TCAGTCGATCGnTA^ 

G3AA(XGTcl^GTTTCM 46891 
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cy 
Db 



cy 

Db 



370 G^GAATTCTAGAAG3MTTCG3CXX^GC^^ 429 
46890 GATGAACTCTCAAA^ 46831 
430 G3MTCACX3GCG3AG3AACTACT^ 489 



490 GAT^^CrarTTTTGAAMTG^033^^ 549 

46770 ^m^my.mctu^u^Vcyxiiv w „ 

550 GAAGGn~CTGAACMGTQGnTT^GAAMTTQOT^ 609 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
46710 GATTCAGAA^GAACTQGfTTT(^CAAAATCQCCCTGAAQ3AQ3CGATCACT^TTTQGAA 46651 

610 ^QGGn"^QGAQCTGATCGrCTACCTGGGCTATTAGAAQ3ATCAGAGAGAGn"CT^QGTA 669 
II II II I I I I I I I I I I I III I I I I I I II I I I I I I I I I I I I I I II III 
46650 GAACQQGAAAAACTGATTGrCTATCTGA^TATTATAAAGATCAGAGAGAGTCQG^GrA 46591 

670 GGAGAGGGn~CTAGQ^TTTCGCAGGrCCAQCT 729 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
46590 GOCGAACGQCTCGQ^TTTCCCMCT 46531 

730 ACGATCMQGAQCAAATTGAACAT 753 
III II I III II II III 
46530 GAGMTAAAGTQCAGMQGMGAT 46507 



CP000813 21/ c 








WCCMVENT 








Sequence spl 


it i nt o 


37 f ragrrer 


it s LOCUS 


Fr agment 


Narre 


Begi n 


End 


CP000813 


00 


1 


110000 


CP000813" 


"01 


100001 


210000 


CP000813" 


02 


200001 


310000 


CP000813" 


03 


300001 


410000 


CP000813" 


"04 


400001 


510000 


CP000813" 


"05 


500001 


610000 


CP000813" 


"06 


600001 


710000 


CP000813" 


07 


700001 


810000 


CP000813" 


08 


800001 


910000 


CP000813 


09 


900001 


1010000 


CP000813" 


10 


1000001 


1110000 


CP000813" 


"11 


1100001 


1210000 


CP000813" 


12 


1200001 


1310000 


CP000813" 


"13 


1300001 


1410000 


CP000813 


14 


1400001 


1510000 


CP000813" 


"15 


1500001 


1610000 


CP000813" 


"16 


1600001 


1710000 


CP000813" 


"17 


1700001 


1810000 


CP000813" 


"18 


1800001 


1910000 


CP000813 


19 


1900001 


2010000 


CP000813" 


"20 


2000001 


2110000 


CP000813" 


"21 


2100001 


2210000 


CP000813" 


"22 


2200001 


2310000 


CP000813" 


"23 


2300001 


2410000 


CP000813 


24 


2400001 


2510000 


CP000813" 


"25 


2500001 


2610000 


CP000813" 


"26 


2600001 


2710000 


CP000813" 


"27 


2700001 


2810000 


CP000813" 


"28 


2800001 


2910000 


CP000813 


29 


2900001 


3010000 


CP000813" 


"30 


3000001 


3110000 


CP000813" 


"31 


3100001 


3210000 


CP000813" 


"32 


3200001 


3310000 


CP000813" 


"33 


3300001 


3410000 
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CP000813_34 3400001 3510000 

CP000813_35 3500001 3610000 

CP000813_36 3600001 3704465 

Continuation (22 of 37) of CP000813 from base 2100001 (CP000813 Bacillus purri I us SAFR- 

Query Match 46.9% Score 354.6; DB 14; Lengt h 1 1 0000; 

Best Local Si rri I ar i t y 67.6% Pred. No. 5. 6e-102; 

Matches 498; Conservative 0; Msrratches 239; I ndel s 0; Gaps 0; 

AAAMT(XX^GTCM(XATTTCT 75 
MQCAAGCAAAAAAAQCCCAQCT 1 4442 

GAAGCTG3CG^ACQ3ATQGACCT 1 35 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
CAAGAOGGTGATCAQCMQGMGACW^ 1 4382 

TCmrCGTCCAQ3Q3TTTATCMCCQCQGGTATGAAQCQ3ATGATTTCT^ 1 95 

I I I I I II I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I II II 
TCCGJTGJTCAQCQCTTTTTAAATAGAQ3ATATGAACXIAGATGATTTGJTTCAAATTQ3C 1 4322 

TQCATTQ3CTTQCTCMQQGCGJTGACMGJTCGATCTTTCGJACX^TGJGAGATTTTCG 255 

II I I I I I I I I I I II I I I I I I I II II III I II II I I I I I I I I I I I I I 



III II MINIM II II Mill II II II I II Mill II II 

TQCTGJTCCMTGATTATTQ3TGAMTCCAQ3GATTCATTCQ3GATGATQ3AACA 1 4202 

316 GTTMQ3TCAGn~CGATCGnTAAMGAM 375 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II III I I I I I I 
14201 GTCAAAGTCAQCCQ3TCACTAMGGAACTCX33AMTAAMTCCQ3CQ3GCAAG^ 14142 



436 ACQOCQGAQGAAGTAGTCTTTQCGCMGA^ 495 

ymiiuuiuu^ U02: 



Cy 








Cy 


76 






Cy 








Cy 








Cy 








Cy 








Cy 








Cy 


436 






Cy 




Db 


14021 


Cy 


556 


Db 


13961 


Cy 


616 


Db 


13901 


Cy 


676 


Db 


13841 


Cy 


736 


Db 


13781 



^GGAGCTGATCGTCTACCTGCQCW 675 

13842 



13781 CAAATQGAAATQ3ATCA 13765 



RESULT 8 
CP000557_23/ c 
WCCMVENT 

Sequence split into 36 fragrrents LOCUS CP000557 Accession CP000557 
Fragment Name Begin End 

CP000557_00 1 110000 

CP000557 01 100001 210000 
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CP000557 


02 


200001 


310000 


CP000557" 


"03 


300001 


410000 


CP000557" 


"04 


400001 


510000 


CP000557 


05 


500001 


610000 


CP000557" 


"06 


600001 


710000 


CP000557" 


"07 


700001 


810000 


CP000557" 


"08 


800001 


910000 


CP000557" 


"09 


900001 


1010000 


CP000557 


10 


1000001 


1110000 


CP000557" 


"11 


1100001 


1210000 


CP000557" 


"12 


1200001 


1310000 


CP000557" 


"13 


1300001 


1410000 


CP000557" 


"14 


1400001 


1510000 


CP000557 


15 


1500001 


1610000 


CP000557" 


"16 


1600001 


1710000 


CP000557" 


"17 


1700001 


1810000 


CP000557" 


"18 


1800001 


1910000 


CP000557" 


"19 


1900001 


2010000 


CP000557 


20 


2000001 


2110000 


CP000557" 


"21 


2100001 


2210000 


CP000557" 


"22 


2200001 


2310000 


CP000557" 


"23 


2300001 


2410000 


CP000557" 


"24 


2400001 


2510000 


CP000557 


25 


2500001 


2610000 


CP000557" 


"26 


2600001 


2710000 


CP000557" 


"27 


2700001 


2810000 


CP000557" 


"28 


2800001 


2910000 


CP000557 


29 


2900001 


3010000 


CP000557" 


"30 


3000001 


3110000 


CP000557" 


"31 


3100001 


3210000 


CP000557" 


"32 


3200001 


3310000 


CP000557 


33 


3300001 


3410000 


CP000557" 


34 


3400001 


3510000 


CP000557" 


"35 


3500001 


3550319 



Continuation (24 of 36) of CP000557 from base 2300001 ( CP000557 Geobaci I I us thermodeni 



Query IVat ch 

Best Local Si rri I ar i t y 

IVatches 485; Conserv. 



Qy 


53 


Db 


58581 


Qy 


113 


Cb 


58521 


Qy 


173 


Cb 


58461 


Qy 


233 


Db 


58401 


Qy 


293 


Db 


58341 


Qy 


353 


Db 


58281 


Qy 


413 


Db 


58221 


Qy 


473 



Score 353.6; DB 14; Lengt h 1 1 0000; 
Pred. No. 1. 2e- 101 ; 
0; M smat ches 219; I ndel s 0; Ga 



. I I I I I I I I I ! I I I I I I I I I 



^V^C^^ 58402 



I Mill II II I MINI II II Mill MMIMIMI II Mill 



I I I I I I I I I I I I I I I 

TCTCCG33^TG^CGQGACQGn"GAMGTCAGC^^ 58282 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



473 ^GGGCCTTCCTCGATCCATGAGACCGTT^ 532 
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Db 58161 GCTOG0OGGCCT<^T^ 

Gy 533 TCX^TCAGATAGCG^TGMGGTCT 592 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II 

Db 58101 TCX^TGAMTCXXXX^TGCTGA^ 

Gy 593 <XATCAGCAGGCTa\^^ 652 

III III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 58041 C!GATTGMGAGCTCGATGAGCQCGAAAG3C^^ 57982 

Cy 653 AGAGAGACTCT^a^AQGAGAaXHTCTAQQ3ATTTa^Q3rCCAQG^CTCQ3CTCTQG 712 

Db 57981 AAACCCAGTCCGAACT 57922 

CV 713 AAAAQCCTATCCTQGTAA0GMGAAQ3AQGAAATTGAAGATTAG 756 

Db 57921 AAAAGAAAATATTokoCAGATAM 57878 



RESULT 9 

AP006627_19 

WCCMVEIMT 

Sequence split into 43 fragments LOCUS AP006627 Accession AP006627 



Fr agrrent 


Name 


Begi n 
1 


End 


AP006627 


00 


110000 


AP006627" 


"01 


100001 


210000 


AP006627 


02 


200001 


310000 


AP006627" 


~03 


300001 


410000 


AP006627" 


"04 


400001 


510000 


AP006627" 


05 


500001 


610000 


AP006627" 


"06 


600001 


710000 


AP006627" 


07 


700001 


810000 


AP006627" 


08 


800001 


910000 


AP006627" 


09 


900001 


1010000 


AP006627" 


"10 


1000001 


1110000 


AP006627" 


11 


1100001 


1210000 


AP006627" 


12 


1200001 


1310000 


AP006627" 


"13 


1300001 


1410000 


AP006627" 


"14 


1400001 


1510000 


AP006627" 


"15 


1500001 


1610000 


AP006627" 


"16 


1600001 


1710000 


AP006627" 


17 


1700001 


1810000 


AP006627" 


"18 


1800001 


1910000 


AP006627 


19 


1900001 


2010000 


AP006627" 


20 


2000001 


2110000 


AP006627" 


21 


2100001 


2210000 


AP006627" 


22 


2200001 


2310000 


AP006627" 


23 


2300001 


2410000 


AP006627" 


24 


2400001 


2510000 


AP006627" 


25 


2500001 


2610000 


AP006627" 


26 


2600001 


2710000 


AP006627" 


27 


2700001 


2810000 


AP006627" 


28 


2800001 


2910000 


AP006627 


29 


2900001 


3010000 


AP006627" 


"30 


3000001 


3110000 


AP006627" 


"31 


3100001 


3210000 


AP006627" 


"32 


3200001 


3310000 


AP006627" 


"33 


3300001 


3410000 


AP006627 


34 


3400001 


3510000 


AP006627" 


"35 


3500001 


3610000 


AP006627" 


"36 


3600001 


3710000 


AP006627" 


"37 


3700001 


3810000 


AP006627" 


"38 


3800001 


3910000 


AP006627 


39 


3900001 


4010000 


AP006627" 


"40 


4000001 


4110000 


AP006627" 


"41 


4100001 


4210000 


AP006627" 


"42 


4200001 


4303871 



Continuation (20 of 43) of AP006627 from base 1900001 (AP006627 Bacillus cl ausi i KSM- !■ 
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Query Match 46.1% Score 348.4; DB 14; 

Best Local Si rri I ar i t y 68.2% Pred. No. 5. 6e-100; 
Matches 484; Conservative 0; Msrratches 226; 



Length 110000; 



Cy 


37 


Db 


12188 


Cy 


97 






Cy 


1 57 


Db 


12308 


Gy 








Gy 




Db 


12428 


Gy 


337 


Db 




Gy 


12 (397 






Gy 








Gy 


517 


Db 


12668 


Gy 


577 


Db 


12728 


Gy 


637 


Db 


12788 


Gy 


697 


Db 


12848 



37 CTGA(X^TGACCMCTGAMGATTTG^ 96 



j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 



Mill I II I MM II II II 



MM M M I I I I I I I I I I III 



517 GATCCX^TGACACTGATCGATC^GATAQGGGATGMQGTGTGAACAAGTGGTTTGAGAAA 576 



I I I I I I I M M III I I I I I I I I 



697 GAQ3TCTCGCGTCTQ3AAAAGCX^AT 746 
CAAGTTTCTCG3CTAGAGAAAAAAA 12897 



RESULT 10 
BACSPC2A 
LOCUS 

DEFI Nl Tl CM 

ACCESSI CM 
VERSI CM 
KEYWORDS 
SOURCE 
CRGANI SM 

REFERENCE 
AUTHORS 
Tl TLE 



BACSPC2A 864 bp DNA linear BCT 26- APR- 1 993 

B. subt i I i s spol I A gene encodi ng a spor ul at i on- speci f i c si gma 
f act or , corrpl et e cds. 
Ml 5744 

Ml 5744. 1 Q: 143610 

spor ul at i on- speci f i c si grra f act or . 

Baci I I us subt i I i s 

Baci I I us subt i I i s 

Bacteria; Firm cut es; Baci I I al es; Baci I I aceae; Bacillus. 
1 (bases 1 to 864) 
Yudki n, M D. 

St r uct ur e and f unct i on in a Baci I I us subt i I i s spor ul at i on- speci f i c 
sigma factor: molecular nature of rait at i ons i n spol I AC 
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JOURNAL J. Gen. Mcrobiol. 133 ( Pt 3) , 475-481 ( 1987) 
PUBMED 3116160 

CCMVENT Original source text: B. subt i I i s ( st r ai n 1 68) DNA, clone 
Bgl I I / Pst I . 

Draft entry and clean copy sequence for [1] kindly provided by 
MD. Yukin, 26- MW- 1987. 
FEATURES Locat i on/ Glial i f i er s 

source 1 . . 864 

/ organi sm=" Baci I I us subt i I is" 

/ rrol _t ype=" genorri c DNA" 

/st rai n="WI68" 

/ sub_st rai n=" PY79" 

/db_xref ="taxon: 1423" 
CDS 90. .857 

/ not e=" spor ul at i on- speci f i c si grra f act or " 

/ codon_st art =1 

/ 1 r ansl _t abl e=1 1 

/ pr ot ei n_i d=" AAA22788. 1 " 

/db_xref ="GI : 143611" 

/ 1 ransl at i on=" MDVEVKKNGKNAQL KDHEVKEL I KC6QNGDQQARDL L I EKNIVRL 
VW5WQRFL NRGYEPDDL FQ QGI GL L KSVDKF DLTYDVRFSTYAVPM I GEI QRFI R 
DDGTVKVSRSL KELGNKI RRAKDELSKTLGRVPTVQEI ADHLEI EAEDWLAQEAVRA 
PSSI HETVYENDGDPI TLLDQ ADNSEEKV\FDKI ALKEAI SDLEEREKLI VYLRYYKD 
QTOSEVAERLGI SQVOVSRL EKKI LKQ KVQVDHTDG' 
OR Q N 2 bp upstream of CI al site. 

Query Match 45.7% Score 345.6; DB 14; Lengt h 864; 

Best Local Si rri I ar i t y 66.5% Pred. No. 2. 1e-99; 

Matches 495; Conservative 0; M smat ches 249; I ndel s 0; Gaps 0; 

CAAAMTGOGAGrCMGCATTTCTGACC^ 69 

I I I I I I I I I I I I I III I I I I I I I I I I I I 



Gy 


10 


Db 


102 


Cy 


70 


Db 


162 


Gy 


130 


Db 


222 


Gy 


190 


Db 


282 


Gy 


250 


Db 


342 


Gy 


310 


Db 


402 


Gy 


370 


Db 


462 


Gy 


430 


Db 


522 


Gy 


490 


Db 


582 


Gy 


550 


Db 


642 



CCAG 281 



250 TTTT(DGA(Xn7\TGCQGn~GCCAATGATCAT 309 

Q3TA(DGC^TMC<^GAGTCG^TCGTTAAAAC 369 

II II II Mill II II MINI I II II I Mill I III 

GGAAJXXarAMQCTATCAOGCT^ 461 

GATGAATTGTA^ODMTTCam^aa^^ 429 

I I I I I II II I I II I I I I I I I I I II I I I I I II 



642 AACTCAGAAGAAAAA" 
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cy 
Db 
cy 
Db 
cy 
Db 



610 ^GGGn~^GGAG3TGATCCTCTACCT^ 669 

II III I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

702 GAMGGGAAAMCTMTCGTCTATCTCAGATATTAT^ 761 

670 GSAGAGGCTCTAGQGATTTOQCAQCT^ 729 



730 ACG^GMGG<\GCAAATTGAACAT 753 
822 GAciTC^a^TCAAATQ^TCAT 845 



RESULT 11 
Z99116/C 
LOCUS 

DEFI Nl Tl CM 

ACCESSI ON 
VERSI ON 
KEYWORDS 
SOURCE 
CRGANI SM 

REFERENCE 
AUTHORS 



Tl TLE 

JOURNAL 
PUBPvED 
REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 



Z99116 204537 bp DNA linear BCT 1 8- APR- 2005 

Baci I I us subt Mis corrpl et e genome (sect i on 13 of 21) : f rom 2409151 
to 2613687. 
Z99116 AL009126 
Z99116.2 G: 32468787 

Baci I I us subt i I i s subsp. subt i I i s st r . 168 

Bacillus subtil is subsp. subt i I i s st r . 168 

Bacteria; Firm cut es; Baci I I al es; Baci I I aceae; Bacillus. 

1 (bases 1 to 204537) 

Kunst , F. , Cgasawar a, N. , Ivbszer,!., Al ber t i ni , A. M , Al I oni , G. , 
Azevedo, V. , Bertero, MG. , Bessi er es, P. , Bolotin.A. , Borchert.S. , 



nel I , S. C. , 
Capuano, V. , 
F. , Currrri ngs, N. J. , 
A., Ehrl i ch, S. D. , 
, C. , Fer r ar i , E. , 



ss, R. , Boursi er , L. , Brans, < 
Bron,S. , Br oui I I et , S. , Br uschi , C. V. , Cal dwel I , B 
Carter, N. M , Choi , S. K. , Codani.J.J., Connerton, 
Dani el , R. A. , Deni zot , F. , Devi ne, K. M , Duster hot t, 
Errrrer son, P. T. , Ent i an, K. D. , Er r i ngt on, J. , Fabret, 
Foulger.D. , Fritz.C. , Fuj i t a, M , Fuj i t a, Y. , Fuma, S. , Gal i zzi , A. , 
Galleron.N. , Ghi m S. Y. , Gaser.P. , Goffeau.A. , Gol i ght I y, E. J. , 
G-andi,G, Oii seppi , G , Qjy, B. J. , Haga, K. , Haiech.J., Har wood, C. R. , 
Henaut , A. , Hi I bert , H. , Hoi sappel , S. , Hosono, S. , Hul I o, M F. , 
1 1 aya, M , Jones, L. , Joris.B. , Kar arrat a, D. , Kasahar a, Y. , 
Kl aerr- Bl anchard, M , Kl ei n, C. , Kobayashi , Y. , Koet t er , P. , 
Koni ngst ei n, G. , Krogh.S. , Kumano, M , Kurita, K. , Lapi dus, A. , 
Lardi noi s, S. , Lauber.J., Lazar evi c, V. , Lee, S. M , Levi ne, A. , Li u, H. , 
^suda, S. , ^uel , C. , Msdi gue, C. , Fvedi na, N. , rvel I ado, R. P. , 
M zuno, M , IVbest I , D. , Nakai , S. , Noback, M , Noone, D. , 0 Rei I I y, M , 
Cgawa, K. , Ogi war a, A. , Oudega, B. , Par k , S. H. , Parro.V. , Pohl , T. M , 
Port et el I e, D. , Por wol I i k, S. , Pr escot t , A. M , Presecan.E. , Puj i c, P. , 
Pur nel I e, B. , Rapoport.G. , Rey, M , Reynolds, S. , R eger , M , 
R volt a, C. , Rocha, E. , Roche, B. , Rose, M , Sadai e, Y. , Sato.T. , 
Scanl an, E. , Schl ei ch, S. , Schr oet er , R. , Scot f one, F. , Seki guchi , J. , 
Sekowska, A. , Ser or , S. J. , Serror,P. , Shi n, B. S. , Sol do, B. , 
Sorokin.A. , Tacconi , E. , Takagi , T. , Takahashi , H. , Takerrar u, K. , 
Takeuchi , M , Tarrakoshi , A. , Tanaka, T. , Terpstra, P. , Tognoni , A. , 
Tosato,V. , Uchi yarra, S. , Vandenbol , M , Vannier,F. , Vassar ot t i , A. , 
Viari.A. , Varrbutt,R , V\edl er , E. , Wedl er , H. , Wei t zenegger , T. , 
Wnters.P. , W pat , A. , Yarramot o, H. , Yarrane, K. , Yasurrot o, K. , Yat a, K. , 
Yoshi da, K. , Yoshi kawa, H. F. , Zurrst ei n, E. , Yoshi kawa, H. and 
Danchi n, A. 

The corrpl et e genorre sequence of t he gr am posi t i ve bacterium 

Baci I I us subt i I i s 

Nature 390 (6657), 249-256 ( 1997) 

9384377 

2 (bases 1 to 204537) 

Kunst , F. , Cgasawar a, N. , Yoshi kawa, H. and Danchi n, A. 
Di r ect Subrri ssi on 

Subni tt ed (27- JUN-2003) I. Ivbszer, A. Danchi n, I nst i t ut Pasteur, 
Genet i que des Genomes Bacteriens, 28 rue du Docteur Roux, 75724 
Paris Cedex 15, FRANCE. E-mail: moszer @)ast eur . f r , 
adanchi n(opast eur . f r Phone: +33 (0)1 45 68 84 41, Fax: +33 (0)1 45 
68 89 48 

Cn Jul 7, 2003 this sequence version replaced gi: 2634723. 
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Thi s ent ry cont ai ns dat a f rom rel ease R16. 1 of t he Subt i Li st 
database. Further data on gene annotation and detailed information 
about changes from previous releases can be found at 
ht t p: / / genol i st . past eur . f r/ Subt i Li st / . 
FEATURES Locat i on/ Qual i f i er s 

source 1. .204537 

/ organi sm=" Baci I I us subt i I i s subsp. subt i I i s st r . 168" 

/ mol _t ype=" genorri c DNA" 

/st rai n=" 168" 

/db_xref="t axon: 224308" 
gene corrpl errent ( 61 . .633) 

/ gene=" ypaA" 

/ 1 ocusj ag=" BSU23050" 
CDS corrpl errent (61. .633) 

/ gene="ypaA" 

/ 1 ocusj ag=" BSU23050" 

/ f unct i on=" unknown" 

/ codon_st ar t =1 

/t ransl _t abl e=11 

/ prot ei n_i d="CAB14237. 1" 

/db_xref =" Gl : 2634740" 

/db_xref =" GGA: P50726" 

/ db_xr ef =" Subt i Li st : BG1 1428" 

/ db_xr ef =" Uni Pr ot KB/ Swi ss- Pr ot : P50726" 

/ 1 ransl at i on=" M<VKKLVWSM_SSI AFVLrVLLNFPFPGLPDYLKI DFSDVPAI I 

AI LI YGPLAQ AVEAI KNVLGYI I QGSM^GVPVGCVANFI AGTLFI LPTAFLFKKLNS 

AKGLAVSLLLGTAArvTTI L^l LNYVLI L PAYTV\FL HSPALSDSAL KTAWAGI LPFNM 

I KQ VI TWFSLI Fl KLKPW EQQRSAHI H" 
gene corrpl errent ( 739. .903) 

/gene="ypzE" 

/ 1 ocusj ag=" BSU23060" 
CDS corrpl errent ( 739. .903) 

/ gene=" ypzE" 

/I ocusj ag=" BSU23060" 

/ f unct i on=" unknown" 

/ codon_st art =1 

/ 1 ransl _t abl e=11 

/ prot ei n_i d="CAB14238. 1" 

/db_xref =" Gl : 2634741" 

/db_xref ="Subt i Li st : BG13767" 

/db_xref =" Uni Prot KB/ Sw! ss - Prot : C32010" 

/ 1 ransl at i on=" IVKFPTGGDEPIVALSPRAVFTAGFGEI PEPTVGSQAEK^VHKRF 

El EFFKRFFA" 
gene 1130. .2707 

/ gene=" ser A" 

/ 1 ocusj ag=" BSU23070" 
CDS 1130. .2707 

/ gene=" ser A" 

/I ocusj ag=" BSU23070" 

/ EC_nurrter=" 1. 1. 1 . 95" 

/ f unct i on=" ser i ne bi osynt hesi s" 

/ codon_st art =1 

/t ransl _t abl e=11 

/ pr oduct =" phosphogl ycer at e dehydr ogenase" 

/ prot ei n_i d="CAB1 4239. 1" 

/db_xref =" Gl : 2634742" 

/db_xref =" GGA: P35136" 

/db_xref =" I nterPro: I PR002912" 

/db_xref =" I nterPro: I PR006139" 

/db_xref =" I nterPro: I PR006140" 

/ db_xr ef =" I nt er Pr o: I PR006236" 

/ db_xr ef =" Subt i Li st : BG1 0509" 

/db_xref =" Uni Prot KB/ Swi ss-Prot : P35136" 

/ 1 ransl at i on=" IVFRVLVSDKIVSNDGLQPLI ESDFI El VCKNVADAEDEL HTF DAL 
L VRSAT KVT EDL FNKMTSL Kl VGRAGVGVDNI Dl DEATKHGVI VI NAPNGNTI STAEH 
TFAM SSL^HI PCANI SVKSREWJRTAYVGSELYGKTLQ VGLGRI GSEI AQRRGAF 
QvTTVHVFDPFLTEERAKKI GVNSRTFEEVL ESADI I TVHTPLTKETKGL L NKETI AKT 
KKGVRL I NCARGG I DEAAL L EAL ENGHVAGAAL DVF EVEP PVDNKL VDHPL VI ATPH 
L GASTKEAQL NVAAGVSEEVLGFAKGL PVIVBAI NLPAMTKDEFAKI KPYHQ AGKI GS 
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LVSQOVKEPVQDVAI QYEGTI AKLETSFI TKAL LSGFL KPRVDSTVNEVNAGGVAKER 

Gl SFSEKI SSSESGYDNQ SVKVTGDRSTFTVTATYI PHFGERI VEI NGFNI DFYPTG 

HLVYI QHQDTTGVI GRVGRI LGDNDI Nl ATMCA/GRKEKG3EAI MVLSFDRHLEDKI VK 

ELTNVPDI VSVKLI DLP" 

conpl ement ( 2750. .3517) 

/ gene=" ar oC" 

/I ocusj ag=" BSU23080" 

conpl ement (2750. .3517) 

/ gene=" ar oC" 

/ 1 ocusj ag=" BSU23080" 

/ EC_nunt>er="4. 2. 1.10" 

/ f unct i on=" shi ki mat e pat hvway" 

/ codon_st art =1 

/t ransl J abl e=1 1 

/ pr oduct =" 3- dehydr oqui nat e dehydr at ase" 

/protei nj d="CAB14240. 1" 

/db_xref =" Gl : 2634743" 

/db_xref =" GCA: P35146" 

/ db_xr ef =" I nt er Pr o: I PR001 381 " 

/ db_xr ef =" Subt i Li st : BG1 0538" 

/ db_xr ef =" Uni Prot KB/ Swi ss- Pr ot : P35146" 

/ 1 ransl at i on=" MWLTI KGVSI GEGVPKI I I PLM3KTEKQ LNEAEAVKLLNPDI 

VEWMDVFEKANDREAVTKLI SKLRKSLEDKLFLFTFRTHKEGGS^MDESSYLALLE 

SAI QTKDI DLI Dl ELFSGDAIWKALVSLAEENNVYWIVBNHDFEKTPVKDEI I SRLRK 

M3DLGMHI PKIVAVIVPNDTGDLLTLLDATYTIVKTI YADRPI I TM3M\ATGL I SRLSGEV 

FG3ACTFGAGEEASAPGQ PVSELRSVLDI LHKKfTRG' 

conpl errent ( 3629. .4735) 

/ gene=" rsi X" 

/ 1 ocusj ag=" BSU23090" 

conpl errent ( 3629. .4735) 

/ gene=" rsi X" 

/ 1 ocusj ag=" BSU23090" 

/ f unct i on=" negat i ve regul at i on of si grra- X act i vi t y" 

/ not e=" al t er nat e gene narre: ypuN' 

/ codon_st art =1 

/ 1 ransl J abl e=11 

/protei nj d="CAB14241. 1" 

/db_xref =" Gl : 2634744" 

/db_xref =" GCA: P35166" 

/db_xref ="Subt i Li st : BG10537" 

/db_xref =" Uni Prot KB/ Swi ss-Prot : P35166" 

/ 1 ransl at i on=" MVKSEWJEEQ KELLSQLPAVKDHRSPCDI YKRLTIVAKRKNKPA 
VRW GPAGAAAI AVYI AFI I SPHFFDCACPCDKEASQENAVTKTETEDSPKAASSLDQ 
TSF WP EKEQDNYI TVAVADADTSAI I PVSI QKTNADQTI QDM_FESSELQ LDHAI T 
I PTFI DEVEI KEKPKQKELSI RVHQPATAFSI KDDTL L KKL L KESL KWSPYEKVKFLS 
DQNETGVRI GSYGTFTEI SI PKCSKRSYYLYCMKQ3QDFLVPSNHSFDTVKEAI KE^ 
SSSGEDTTPLI CAGAVQSVTKKQKHLYIVRFSKESEVDDSI AQ LM EGLLLTAKEFGF 
TEVTFTETRTKKI GKYDI SDAI PVPAAPNPI SLIM' 
conpl errent (4671. .5255) 
/ gene=" si gX" 
/I ocusj ag="BSU23100" 
conpl errent ( 4671 . . 5255) 
/ gene="si gX" 
/I ocusj ag="BSU23100" 
/ f unct i on=" may part i ci pat e in 
pept i dogl yean synt hesi s and t u 
/ not e=" al t er nat e gene narre: ypuM' 
/ codon_st art =1 
/ 1 r ansl J abl e=1 1 

/ pr oduct =" RNA pol yrrer ase ECF( ext r acyt opl asrri c 
f unct i on) - 1 ype si grra factor ( si grra- X) " 



/ pr ot 
/ db_xr eT = 
/ db_xr ef = 
/db_xref = 
/ db_xr ef = 
/db_xref : 
/db_xref = 
/db_xref : 



CAB14242. 1" 
Gl : 2634745" 
GCA: P35165" 
I nterPro: I PR000838" 
I nterPro: I PR007627" 
I nterPro: I PR007630" 
I nterPro: I PR013324" 
I nterPro: I PR013325" 
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/ db_xr ef =" Subt i Li st : BG1 0536" 

/db_xref =" Uni Prot KB/ Swi ss-Prot : P35165" 

/ 1 ransl at i on=" IVEETFQL L YDT YHQDL YQF L F YIWKDKNQT EDL L QEVYI RVLNS 
YHTFEGRSSEKTW. LSI ARHVAI DV\FRKGQTI RQRI LGTFDWDTQDVRDQQLLPDELA 
VQHENVREI SCALDCCTI DQRAVI I LRFI QGYSI QETAKAL RFSESKVKTTQHRGL KV 
L RKHIVEL L REELMDDEVRIVERRTDKGWKSTSGS" 
gene corrpl ement ( 5459. .7228) 

/ gene=" r esE" 
/ 1 ocus_t ag=" BSU23110" 

Query Match 45.7% Score 345.6; DB 14; Lengt h 204537; 

Best Local Si rri I ar i t y 66.5% Pred. No. 4. 9e-99; 

Matches 495; Conservative 0; M srrat ches 249; I ndel s 0; Gaps 0; 



Gy 10 G^TATCAAAMTGCGAGTCMCCATTTCTGAGCAATGAOGMGTGAAAGATTTGATAGCC 69 

III I I I I I I I I I I I I I III I I I I I I I I I I I I 
Db 34223 GTTMGAAAMCGGGAAAMCGCTGAGCTGAAGGATCATGAAGTAMGGAATTAATCAAA 34164 

Gy 70 MG^GCCMGCTGGOSATACGGATGCACGJGAGCTTCTOGTGAATAGCAATATCAGACTG 129 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I III 
Db 34163 CAMGCCAAMTGGOSACCAGGAGGGMGAGACCTCCTCATAGAAAAAMCATQCGTCT 34104 

Gy 130 GTCTGGTCCGJCGn"CCAGCGCTTTATGMOG3CG3GJATGAAG03GATGAT 189 

Db 34103 

Gy 1 90 ATCG3TTGCATTGQGTTGCTCMGQ0CGTTG^CMGTTO3ATCTTTO3TAC^ 249 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34043 ATCG3CTGCATCG3XTGnTAAMTCTGTTG^ 33984 

Gy 250 TTTTCX^CCTATGCQGn-GCCMTGATCATC«^^ 309 

I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I III I II I I I I I I 
Db 33983 TTTTCAAGGTATQCAG^~GCXIX^TGATTATCQ3AGAMTCCAACG^TTTATCCG^GATGAC 33924 

Gy 310 QGn~AGGSTTAAQGn~CAGrCGATCGTTAAMGAMCAGCGAATA 369 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 33923 Q3AAGCGTAAAQGn~ATGA03Gn"CATTAAM 33864 

Gy 370 GATGAATTGTACAAQCMTTCQ3CG3jrGCOGCX^ 429 

Db 33863 GATGAQCTTTCGAAMCACT 33804 

Gy 430 G3MTCACX3GO33AG3MGTAGTCTTTG30CAAGAG3GAAG^ 489 

Db 33803 GAGATTGAAQCTGAG3ATGTTG^ 33744 

Gy 490 CATGAGACCGTTTTTGAAMTGACX33CGATCX^TGA^ 549 

Db 33743 CAQ^QOG^ 33684 

Gy 550 GAAG3TGTGMCAAGTGGn"TTGAGAAMTTQCCTTGAAG3ACQCCATCAGCAGGCTG^ 609 

I I I II MINIM MMMM MM II II MINI II 
Db 33683 MCTCAGAAGAAAMTGGnTTGACAAMTTGCGCTGAM 33624 

Gy 610 GAGO3n"GAQQAQGTCATCGTCTAGCTQ0GCTAT^ 669 

II III I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 33623 GAMGG3^AAAACTMTCGTCTATCTCAG^TATTATAMGACCAGACAGAG^"OCX^GGTG 33564 

Gy 670 GCAGAGOGn"CTAGG3^TTTG3IAGGTOCAGGTCT 729 

I I I I I I I I I I I I I I I I I I M I M M I M I I I I I I I I I II I I 
Db 33563 GCTGAGOGGCTGjGG^TCTCTCAGGTGGAGOTTTC 33504 

Gy 730 ACGATCAAGGAGCAAATTGAACAT 753 

MMMM Mill II III 
Db 33503 GAGATGAAGGTTCAAATGGATCAT 33480 



RESULT 12 
BACJ H642 
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LOCUS 

DEFI Nl Tl ON 
ACCESSI ON 
VERSI ON 
KEYWORDS 



SOURCE 
ORGANI SM 



REFERENCE 
AUTHORS 



JOURNAL 
PUB^D 
REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 



FEATURES 

source 



BACJ H642 
Baci I I us subt 
D84432 D82370 
D84432. 1 Q: 2627063 



282700 bp 
DNA, 283 Kb re< 



DNA 
on cont ai 



i near BCT 25- WAR- 2006 
i ng ski n el errent . 



LysA; 
spol ' 
Yqkl 
YqkA; 
Yqj P; 
Yqj E; 
Brrr R; 
Yq . 
Spol VB; 
YqhX; - 
Spol I 
YqhP; 
Si nR; 
YqhC; 
YqgS; 
YqgH; 



RpoD; 
Yqf F: 
YqeU; 



Yqel 



YqdF, 
YqbS; 
Yqbl 
YqaR; 
YqaG; 
YrkQ; 
YrkF; 



SpoVAF; SpoVAE; SpoVAD; SpoVAC; SpoVAB; SpoVAA; Spol I AC; 



AB; Spol I AA; DacF; YqkQ YqkN; YqkM YqkL: 

AnsB; AnsA; AnsR; YqkH; YqkG; YqkF; YqkE; 

Yqj Z; Yqj Y; Yqj X; Yqj W Yqj V; " 

Yqj O; Yqj N; Yqj M; Yqj L; Yqj K; 

Yqj D; Yqj C; Yqj B; Yqj A; Yqi Z; 

Bf rrBB; Bf rrBAB; Bf rrBAA; Yqi V; 

Yqi 0; Yqi N; Yqi M; Yqi L; Yqi " 



RecN; AhrC; Yqi 



Yqi 



Spol I M YqkK; YqkJ; 
YqkD; YqkC; YqkB; 
Yqj U; Yqj T; Yqj S; Yqj R; Yqj Q; 
Yqj J; Yqj I ; Yqj H; Yqj G; Yqj F; 
Yqi Y; Yqi X; Yqi W Bnr U; Bnr ; 
Yqi U; Yqi T; Yqi S; Yqi R; Yqi Q 
Yqi J; Yqi I ; Yqi H; Yqi G; SpoOA; 



Yqi D; Yqi C; Yqi B; Yqi A; YqhZ; YqhY; 
YqhW Spol I I AH; Spol I IAQ Spol I I AF; Spol I I AE; Spol I I AD; 
I AC; Spol I I AB; Spol I I AA; YqhV; YqhU; YqhT; YqhS; YqhR; YqhQ; 



YqhO; YqhN; 

YqhF; YqhE; 

YqhB; YqhA; 

YqgR; YqgQ 

YqgG; YqgF; 



YqhM YqhL; YqhK; YqhJ; Yqhl ; YqhH; YqhQ Si nl 



Yqf W Yqf V; Yqf U; 



DnaE; Yqf M; 
Yqf E; Yqf D; 
YqeT; DnaJ; 



YqhD; 
YrkA; 
YqgP; 
YqgE; 
Yqf T; 
Yqf L; 
Yqf C; 
DnaK; 



CorrQG; CorrGE; Com3D; CorrGC; CorrGB; CorrOA; 



YqgZ; 
YqgQ 
YqgD; 
Yqf S; 
Yqf K; 
Yqf B; 
QpE; 



YqgY; 
YqgN; 
YqgC; 
Yqf R; 
Yqf J 
Yqf A; 
YqeS; 



YqgX; YqgW YqgV; YqgU; 

YqgM YqgL; " " ' 

YqgB; YqgA; 

Yqf Q Yqf P; 

Yqf I ; Yqf H; 

YqeZ; YqeY; 

YqeR; YqeQ 



YqgT; 

YqgK; YqgJ; Yqgl ; 
YqfZ; YqfY; Yqf X: 
YqfQ YqfN; CccA; 
Cdd; DgkA; YqfQ 
YqeX; YqeW YqeV; 
YqeP; Spol I P; Gpr 



YqeQ YqeN; ComEC; CorrEB; CorrEA; CorrED; YqeM YqeL; YqeK; YqeJ; 



AroD; YqeH; YqeQ YqeF; 



Spol VCB; Spol VCA; 



YqcM YqcL; 

YqdD; YqdQ 

YqbP; YqbQ 

YqbF; YqbE; 

YqaQ YqaN; 

YqaE; YqaD; 

YrkN; Yrklvt 

YrkQ YrkB; 



YqeE; YqeD; 

YqcK; YqcJ; 

YqcE; YqcD; 

YqbN; YqdB; 

YqbD; YqbQ 

YqaM YqaL; 
YqaQ 
YrkL; 
Bl t R; 



YqeQ 
Yqcl ; 
YqeQ 
YqbM 
YqbB; 
YqaK; 



YqaB; 
YrkK; 

Bl t ; Bl t D. 



YqeB; NucB; 

YqcH; YqcG; YqcF; 

YqcB; YqcA; YqbT; 

YqbL; YqbK; YqbJ; 

YqbA; YqaT; YqaS; 

YqaJ; Yqal ; YqaH; 
Spol I I C; YrkS; YrkR; 

YrkJ; Yrkl ; YrkH; YrkQ 



YqdE; Cwl A; 
YqbR; YqbQ 
YqbH; YqbQ 
YqaQ; YqaP; 
YqdA; YqaF; 
YrkP; YrkQ 
YrkE; YrkD; 
Baci I I us subt i I i s 
Baci I I us subt i I i s 

Bacteria; Firrricutes; Baci I I ales; Baci I I aceae; Bacillus. 
1 

M zuno, M , IVasuda, S. , Takerrar u, K. , Hosono, S. , Sato.T. , Takeuchi , M 
and Kobayashi , Y. 

Syst errat i c sequencing of the 283 kb 210 degrees-232 degrees region 
of t he Baci I I us subt i I i s genome cont ai ni ng t he ski n el errent and 
many spor ul at i on genes 

M crobi ol ogy (Readi ng, Engl.) 142 (FT 1 1), 3103-31 11 (1996) 
8969508 

2 (bases 1 to 282700) 
Sat o, T. 

Di r ect Subrri ssi on 

Subrritted ( 1 6- APR- 1 996) Tsutorm Sato, Tokyo University of 
Agriculture and Technology, Applied Biological Science; 3-5-8 
Saiwai-cho, Fuchu, Tokyo 183, Japan ( E- rrai I : subt i I i s@c. t uat . ac. j p, 
Tel : 81-423-67-5706, Fax:81-423-67-5715) 

On or before Jan 7, 1998 this sequence version replaced gi : 1303696, 
gi : 1212727. 

Locat i on/ Cual i f i ers 

1. .282700 

/ organi sm=" Baci I I us subt i I i s" 

/ rrol _t ype=" genorri c DNA" 

/st rai n=" JH642(t r pC2 PheA1)" 

/ db_xr ef =" t axon: 1423" 

<1. .654 

/ codon_st art =1 

/ 1 ransl _t abl e=1 1 

/ pr oduct =" Yr kA" 

/ prot ei n_i d=" BAA12353. 1" 

/db_xref ="Q : 1303697" 

/ 1 r ansl at i on=" VDLQVNGSI VAI SSEDSYETI VKI I KTESYTRYPVL NGDKDSI I 
GFI NAKEFLSAYI DTDQKI KEDFKL ENHI NPVI HVI ESVPI HDVLVKIVDKERTHI AI L 
VDEYGGTSGL VTAE DI LEEI VGEI RDEFDKDEVPNI RKVNDNHYI LDSKVLI EDVNDL 
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LGTTLASDEVDTI GQAFMTQQ DAAVGSVI EADGYI FKVHETVGRHI NYLEI VRKKE" 
CDS conpl ement (811. .1269) 

/ codon_st ar t =1 
/t ransl _t abl e=11 
/ product =" Bl t D" 
/ prot ei n_i d=" BAA12354. 1" 
/db_xref ="GI : 1303698" 

/ 1 ransl at i on=" IV6I Nl KAVTDDNRAAI LDLHVSQMQLSYI ESTKVCL EDAKECHY 

YKPVGL YYEGDL VGFAIWGL FPEYDEDNKNGRVW. DRFFI DERYQ3KGL GKKIVL KAL I 

QHLAELYKCKRI YLSI FEMMI HAI RLYGRFGFGFISGEL DFNGEKVIWKEL" 
RBS conpl ement ( 1278. .1284) 

CDS conpl errant ( 1438. .2640) 

/ codon_st ar t =1 

/t ransl _t abl e=11 

/ product =" Bl t " 

/protei n_i d="BAA12355. 1" 

/db_xref ="Q : 1303699" 

/ 1 ransl at i on=" M<KSI NECKTI Fl I LLSNI FVAFLQ GLI I PV^SFM<I MHLSG 
STM3YLVAAFAI SOU TSPFAGRVWDRFGRKKM I LGLLI FSLSELI FGLGTHVSI FY 
FSRI LGGVSAAFI IVPAVTAYVADI TTL KERSKAfVGYVSAAl STGFI I GPGAGGFI AGF 
Gl RIVPFFFASAI ALI AAVTSVFI LKESLSI EERHQLSSHTKESNFI KDLKRSI HPVYF 
I AFI I VFVIVAFGLSAYETVFSL FSDHKFGFTPKDI AAI I Tl SSI VAWI QVLLFGKLV 
NKLGEKRM aCLI TGAI LAFVSTVM3GFLTVLLVTCFI F L AF DL L RPAL TAHL SNIVA 
GNCQGF VAGVNSTYTSL GNI FGPALG3 LFDLNI HYPFLFAGFVM VGLGLTIWV\KEK 
KNDAAALISf' 

RBS conpl errant (2648. .2654) 

RBS 2745. .2752 

CDS 2757. .3578 

/ codon_st ar t =1 

/ 1 r ansl _t abl e=1 1 

/ product =" Bl t R" 

/ prot ei n_i d=" BAA12356. 1" 

/db_xref ="Q : 1303700" 

/ 1 ransl at i on=" rVBEDVKKYFTTGEFSKLCRVKKQTLFHYDEI GLFSPEI KKENGY 
RYYSYHQFETFCVI SLFKELGVPLKEI KCLI KGKTPDKI LHVLKEKSI El DKKI NELK 
aOTI LQTKVTLTEQM ETDFSSI SFEYLNEETFIVLSRKTLNLPERKYVAAI SELI HE 
VCO/ELDEGYPI QGI FAREQ LEKDFYNYSYFYI KVKDGAENI NYHVRPKGLYAVGYE 
I QGNTEEAYRRI I EFI ERNGVQ GENAYEEYIVL DEIWVDGYEI\TTYAKI LLCVKEV" 

RBS 3731. .3736 

CDS 3748. .3894 

/ codon_st ar t =1 

/ 1 r ansl _t abl e=1 1 

/product =" YrkB" 

/ prot ei n_i d=" BAA12357. 1" 

/db_xref ="Q : 1303701" 

/ 1 ransl at i on=" IVLLKNWSRRI QRDKSKRAQ QGTNNRI PYTLLLCYVNVQKPFR 
I VDL" 

RBS 4108. .4111 

CDS 4120. .4680 

/ codon_st ar t =1 

/ 1 r ansl _t abl e=1 1 

/ product ="YrkC 

/ prot ei n_i d=" BAA12358. 1" 

/db_xref ="Q : 1303702" 

/ 1 ransl at i on=" IVhTNPFSPQCYYYVTVPIWNDGRSVYWTI PNEIVEKVHRGADL RS 

SYEDRNL L L KDYGP KPF WNI NRATKCNISrrFRTALWrGKHFGVTLIVBLQ GEDI GLEI 

HPNVDQFLRI EG3RQ VKM3KSKDHLNFQRNVYDDSAI WPAGTWNVI IMTGNrTPLKL 

YSI YAPPNHPFGTVHETKADAVAAED' 
RBS 5458. . 5464 

CDS 5473. . 5664 

/ codon_st ar t =1 

/ 1 ransl _t abl e=1 1 

/ product =" YrkD" 

/ prot ei n_i d=" BAA12359. 1" 

/db_xref ="Q : 1303703" 

/ 1 r ansl at i on=" MVEQ3KDCREWTQLAASRNAI DRAM3LI VSTNL EHCVRESL EK 

GEDTQNIL VKEAVDL LVKSR' 
RBS 5797. . 5801 

CDS 5809. .6291 
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/ codon_st ar t =1 

/ 1 ransl _t abl e=1 1 

/ pr oduct =" Yr kE" 

/ prot ei n_i d=" BAA12360. 1" 

/db_xref ="Q : 1303704" 

/ 1 r ansl at i on=" MTEQTKmTI VL FSGDYDKAM*\AYI I ANGAAAYDHEVT I FhTTFW 

GFNALRKEELI PVKKGFLEKM^KMVPRGADKMSLSKMSIFAGVGPKM KNVIVKKHIWL 

TLPQLI EM^GEGGVKL VACTMTMDL LGLQEKEL L DDI DYAGVAAYL ADAEEGSVNL F I " 
RBS 6464. . 6471 

CDS 6477. . 7034 

/ codon_st ar t =1 

/ 1 ransl _t abl e=1 1 

/ product =" YrkF" 

/protei n_i d="BAA12361. 1" 

/db_xref ="Q : 1303705" 

/ 1 ransl at i on=" MvKATI VL DAKGL ACPIVPI VKTKKPJvKDLKAGEVLEI HATDKGS 

TADL EAWAKSTGHEYLGTEAEGEI L RHF L RKGGEHSSENASSI PEI SLEAFKGKVDSD 

ESLNI LDVREI EEYEKAHI PGWHI PLGEVEKRANELNENDEI Yl I CHSGRRSEMAAR 

TIVKKGGFKKVI NWPGVRDWTGKTE" 
RBS 7361. .7367 

CDS 7376. .7486 

/ codon_st ar t =1 

/t ransl _t abl e=11 

/ product =" YrkG' 

/ prot ei n_i d=" BAA12362. 1" 

/db_xref ="GI : 1303706" 

/ 1 ransl at i on=" IVfrVKAMTPKEVTKKSLTKNL YL FV\M3VM<M LVI GR" 
RBS 7597. .7605 

CDS 7615. .8502 

/ codon_st ar t =1 

I \ ransl _t abl e=11 

/ product =" YrkH' 

/ pr ot ei n_i d=" BAA1 2363. 1 " 

/db_xref ="Q : 1303707" 

/ 1 ransl at i on=" IWSEIVL SEAGHSVSYL KQGI\/KAV\8EHL EPVKVGDL KDGGEIVYQF 
VRI GKGCLSYIWI SNGEAAI I DATRIWNFFI DFAKGKGATI THVFDTHLHADHI SGGR 
VI AEKTKATYVMLPPKDAEEVEFDYQPLEEGNDI I I GNTTI Kl CPI YSPGHTI GSTSFI 
VDDQYLLSGDI LFI DSI GRP DL AGVAEDW/GDL RKT L YSRYKAL SKDL I VLPAHFM I 
DELNENGSVGKQLSTL FSENHGL Nl ADETEFRHLVTDHLPPQPNAYQEI RETNM3KI S 
PDEEKGREIVEI GPNRGAI R' 

RBS 8523. .8527 

CDS 8536. .8763 

/ codon_st art =1 

/ 1 r ansl _t abl e=1 1 

/ pr oduct =" Yr kl " 

/ prot ei n_i d=" BAA12364. 1" 

/db_xref ="GI : 1303708" 

/ 1 ransl at i on=" M<SDKVL DAKGL ACPVPI VRTKKAMMEL ESGQ LEVHATDKGAK 

NDLTAWSKSGGHDL L EQTDEGDI LKFW GKG' 
RBS 8807. .8815 

CDS 8824. .9609 

/ codon_st ar t =1 

I \ ransl _t abl e=11 

/ product =" YrkJ" 

/protei n_i d="BAA12365. 1" 

/db_xref ="GI : 1303709" 

/ 1 r ansl at i on=" MDI AFI TTLFI I GFI GAFI SGM Gl GGAVI NYPM.LYI PSLVGV 
M3LTAHEVSQ GAI GVFFATLGGW\AYRKSGLI NKTLI I YMG3SI LLGSVLGSYFSHY 
I SEKQ NFI YGI LAI I AVI LVFI PKKGQKQQEHSEDKEVI FNKW.ASSLAFI I GGVSG 
I LGAGGAFI LVPI M_SI LNI PVRVTVASSLAI TFLSSI GATVGKVI TGGVLFVPALVL 
M TSLI ASPI GASVGQKVNTKFLGVWL AOL I GATAI Kl WELL" 
RBS 10177. .10182 

Query Match 45.7% Score 345.6; DB 14; Lengt h 282700; 

Best Local Si rri I ar i t y 66.5% Pred. No. 5. 2e-99; 

Matches 495; Conservative 0; Msrratches 249; I ndel s 0; Gaps 0; 

Gy 10 GATATCAAAMTGXAGTCAACGATTTCTGACCMT 69 

I II Mill I I I I I I I I II I I I I I II II II II 
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Db 275466 GTTMG^AAMCQQGAAAMCXCTCAQCTGAAQ^TCATGMGn"AMQG^TTMTC^ 275525 

Cy 70 M^QCGAAQ3TQQGGATACQ^TQGAa^G^Q0rTCTCGn"^TAQ3MTATCA^CTG 129 

Db 275526 GAAAQCGAAM 275585 

Cy 130 GrCTGCTCnjTCGn~(XAGCGCT^ 189 

II I I I I I I I I I I I I I I I III I III I II I I I I I I II III I II III 
Db 275586 GTTTQGTCTGrCGn"ACAQCQGn"TTTTAM(^GAQ3ATATGAQ3CT^CX^TCTCTTa^G 275645 

Cy 1 90 ATCQGn"TQGATTQQCTTQCTCMGQGCGTT^GAAGTTCGATCTTTCGn"ACX^TGrGAGA 249 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 275646 ATCQ3CTQGATCQQ33TGn"TAAMTCTGTr^CAMTTTGATTTMCXDTAT^TGrQCCT 275705 

Cy 250 TTTTCX^CCTATGCGGn"G(XAATGATCA^ 309 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I II I I I I I I 
Db 275706 TTTTCMCGTATQGAGn"QCCGATGATTATa^G^MTCCMa^TTTATCmr^T^C 275765 

Cy 310 GGn"AQ3CTTMQGn"CACn~CGOT^ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 275766 Q3MCCGTAMQGn7\TGACam^TTAAM 275825 

Cy 370 GATGAATTGTACMQCMTTCOB^^ 429 

Db 275826 G^QCTTTCGAA^^ 275885 

CV 430 Q3MTCACQ0CQ3AQ3MGTAGrc^ 489 

Db 275886 UgATTGAAQCTGAQ3ATCTTCTACT 275945 

CV 490 CATGAGACGGTTTTTGAAMTGACmDGAT 549 

II II I I I I I I I I I I I I I I I I I I I I I I I I II II III I I I I I I II II II 

Db 275946 GACCWV\CmTTTATGAAAAT(^CC^ 276005 

cy 550 c^asn"GTC^(^Gra3nTT^ 6 09 

Db 276006 AACTCAGMGAAAMTG^^ 276065 

Cy 610 G^QCGTG^QGAQCTC^TajrCT^^ 669 

Db 276066 276125 

Cy 670 QCAGAQCC^CTAGQ3^TTTCCiCAGGTC^^ 729 

Cy 730 ACGATCAAQ3^QCAAATTGAACAT 753 

Db 276186 CAG^TCAAC<^TCAAATai\TCAT 276209 



RESULT 13 

BA000004_16 

WCCMVEIMT 

Sequence split into 42 fragrrents LOCUS BA000004 Accession BA000004 



Fr agrrent 


Name 


Begi n 


End 


BA000004 


00 


1 


110000 


BA000004" 


"01 


100001 


210000 


BA000004" 


"02 


200001 


310000 


BA000004 


03 


300001 


410000 


BA000004" 


"04 


400001 


510000 


BA000004" 


"05 


500001 


610000 


BA000004" 


"06 


600001 


710000 


BA000004" 


"07 


700001 


810000 


BA000004 


08 


800001 


910000 


BA000004" 


"09 


900001 


1010000 


BA000004" 


"10 


1000001 


1110000 


BA000004" 


"11 


1100001 


1210000 


BA000004" 


"12 


1200001 


1310000 
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BA000004 


13 


1300001 


1410000 


BA000004" 


"14 


1400001 


1510000 


BA000004" 


"15 


1500001 


1610000 


BA000004 


16 


1600001 


1710000 


BA000004" 


"17 


1700001 


1810000 


BA000004" 


"18 


1800001 


1910000 


BA000004" 


"19 


1900001 


2010000 


BA000004" 


"20 


2000001 


2110000 


BA000004 


21 


2100001 


2210000 


BA000004" 


"22 


2200001 


2310000 


BA000004" 


"23 


2300001 


2410000 


BA000004" 


"24 


2400001 


2510000 


BA000004" 


"25 


2500001 


2610000 


BA000004 


26 


2600001 


2710000 


BA000004" 


"27 


2700001 


2810000 


BA000004" 


"28 


2800001 


2910000 


BA000004" 


"29 


2900001 


3010000 


BA000004" 


"30 


3000001 


3110000 


BA000004 


31 


3100001 


3210000 


BA000004" 


"32 


3200001 


3310000 


BA000004" 


"33 


3300001 


3410000 


BA000004" 


"34 


3400001 


3510000 


BA000004" 


"35 


3500001 


3610000 


BA000004 


36 


3600001 


3710000 


BA000004" 


"37 


3700001 


3810000 


BA000004" 


"38 


3800001 


3910000 


BA000004" 


"39 


3900001 


4010000 


BA000004 


40 


4000001 


4110000 


BA000004" 


"41 


4100001 


4202352 



Continuation (17 of 42) of BA000004 from base 1600001 ( BA000004 Bacillus halodurans C- 

Query Match 45.4% Score 343; DB 14; Lengt h 1 1 0000; 

Best Local Si rri I ar i t y 67.3% Pred. No. 3. 1e-98; 

Matches 484; Conservative 0; Msmatches 235; I ndel s 0; Gaps 0; 



Cy 38 TGAGCAATGACCAAGTGAAAGATTTGATAGTC^ 97 

Db 14367 TTACAGACX^QGAAATTAAGCAGCTCA 14426 

Cy 98 GTG^GCTTCTCGTG^TAQGMTAT 157 

Db 14427 GAGATCAAATCGTGAGTCCT 14486 

Cy 158 A(XQCQ3Gn"ATG^AQCQ3ATGA^ 2 1 7 

I I I I I I I I I I I I I I I II II I I I I I I I II I I I I I I I I I I I I I I I I II I I 

Db 14487 ATCQCO^ATGAAQCAGACGACCTGTTO 14546 

Cy 218 TTG^CMGnTCG^TCTTTCGTACGATGnX^^ 277 

I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14547 TGGATAMTTCGATTTGTCGTACGATGTGAAGnTTTCCAC^ 14606 

Gy 278 TCX33AG*W\TTCAACGCTTTTTQ^ 337 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 14607 TCG3TGAMTCGAACGATTCCTTCQCGATGATGGTACG3TAM 14666 

Gy 338 MGAMCAGCGAATMQGn"QCQ3CGATCAMGGA^ 397 

III I I I I I I I I I I I I I I I I I I II III I I I I I I 

Db 14667 AGGAGCTTGGCMTAAMTTCGGAAGCTCy\AG^ 14726 

Gy 398 CCCCCACGATCX£AGAAGn-Q3CAGM 457 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14727 CGCCGACGGTGAATGAMTTQCGSAGC^^ 14786 

Gy 458 CGCAAG^GGGAAGCAGAGCQCCTTCCTCCATCCATG^ 517 

Ob 14787 ^1^^^^^^^^^ ,4846 

Gy 518 ATCCCATCACACTGOTCGATCAGATAGOG3ArGMG3TCT 577 

lllllll II II I Mill II III II I I II I MINIM II I 
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Db 14847 ATOOCATTAQOCTTTTG^^ 14906 

Cy 578 TTGGCTTGMGGACGCCATCAQGAC^^ 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 14907 TCGCCTTAAMGAAGOC^TTCATMCT^ 14966 

0/ 638 GCTATTACMGG^GA^GACAGT^^ 697 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 14967 CTTATTATAMG^CAMCODMTCCGAC^ 15026 

Cy 698 AGGTCTCQX^CTQSAAMQCGTATCCT 756 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 15027 MGTATCGAC^CTTGAAMGAAMT(XT^^ 15085 



RESULT 14 
CP000764_28/ c 
WCCMVEIMT 

Sequence split into 41 fragments LOCUS CP000764 Accession CP000764 



Fr agrrent 


Name 


Begi n 


End 


CP000764 


00 


1 


110000 


CP000764" 


"01 


100001 


210000 


CP000764" 


~02 


200001 


310000 


CP000764 


03 


300001 


410000 


CP000764" 


"04 


400001 


510000 


CP000764" 


05 


500001 


610000 


CP000764" 


"06 


600001 


710000 


CP000764 


07 


700001 


810000 


CP000764" 


~08 


800001 


910000 


CP000764" 


09 


900001 


1010000 


CP000764" 


"10 


1000001 


1110000 


CP000764" 


11 


1100001 


1210000 


CP000764" 


12 


1200001 


1310000 


CP000764" 


~13 


1300001 


1410000 


CP000764" 


"14 


1400001 


1510000 


CP000764" 


15 


1500001 


1610000 


CP000764" 


"16 


1600001 


1710000 


CP000764 


17 


1700001 


1810000 


CP000764" 


"18 


1800001 


1910000 


CP000764" 


"19 


1900001 


2010000 


CP000764" 


20 


2000001 


2110000 


CP000764" 


"21 


2100001 


2210000 


CP000764" 


22 


2200001 


2310000 


CP000764" 


~23 


2300001 


2410000 


CP000764 


24 


2400001 


2510000 


CP000764" 


25 


2500001 


2610000 


CP000764" 


26 


2600001 


2710000 


CP000764" 


27 


2700001 


2810000 


CP000764" 


~28 


2800001 


2910000 


CP000764" 


29 


2900001 


3010000 


CP000764" 


30 


3000001 


3110000 


CP000764" 


"31 


3100001 


3210000 


CP000764" 


"32 


3200001 


3310000 


CP000764" 


"33 


3300001 


3410000 


CP000764 


34 


3400001 


3510000 


CP000764" 


"35 


3500001 


3610000 


CP000764" 


"36 


3600001 


3710000 


CP000764" 


"37 


3700001 


3810000 


CP000764" 


"38 


3800001 


3910000 


CP000764 


39 


3900001 


4010000 


CP000764" 


"40 


4000001 


4087024 



Continuation (29 of 41) of CP000764 from base 2800001 (CP000764 Bacillus cereus subsp. 

Query Match 45.2% Score 342; DB 14; Lengt h 1 1 0000; 

Best Local Si rri I ar i t y 67.6% Pred. No. 6. 5e-98; 

Matches 480; Conservative 0; Msrratches 230; I ndel s 0; Gaps 0; 

Cy 47 ACCMGTGAMG^TTG^AQCCA^^ 106 

II I I I I I I II II II II I I I I III III I II III 
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Db 53779 ACX^GnTAAMGSCTTMTTGAAAA^ 53720 

Cy 107 TCX^GMTAGSMTATGAG^TQG^^ 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 53719 TTCTTGAMCTMTATG3GGCTTGn7\T^ 53660 

Cy 167 ATGAAGGGGOTG^TTGTTTCAGOTC^ 226 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I 
Db 53659 ATGAACCC^CX^CTTATTTCAMTTGQCTCT 53600 

Cy 227 TCX^TCTTTCGn"ACGATGn"^^TTTTCGftGCTATQ3Q3TQXMT^TGATCQ^GAM 286 

I III I II I I I I I I I I I I I I I II II II II I I I I I I I I II II I I I I 
Db 53599 TT^TTTATCTTTT^TGn"GAAGTTTT(W\CTTACX33AGrTCCAAT^TTATTQGrGAAA 53540 

Cy 287 TTCAACGCTTTTTQCGCGATGACG^ 346 

I I I I I I I I I I I I II I I I I I I I I I I II II II II I I I I I I I I I I I I I I I I I I 
Db 53539 TTGAACGTTTTTTACGn"GATGACGGTTCAGTAAMGrG^QCCGATCGrTAAMGAM^ 53480 

Cy 347 CGMTMGCTGCGQCGATCAAAQ3ATGA 406 

I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I 

Db 53479 GTMTMGATTCGAMG^GAMGATGAQCTTTCGAM 53420 

Cy 407 TCQGAGA^G^QQCAGAAQGAG^Q3GA^TCAam^Q3^AG^AGTCTTTQCQ3AAGAQG 466 

Ob 534,9 WMT^T^^U<i^lMA^T^TcVm^^ 53360 

Cy 467 CAAC£AGAOmDTTCCTC€ATCCATG^^ 526 

I I I I I I I I I II II II I I I I I II II I I I I I I I I I I II I I I I I I I I I 
Db 53359 CAAGTCGTACC^XIATCTTCTATACATGAMCTCT 53300 

Cy 527 CACTCV\TC^TCAC^TACm3ATCV\AC^GT^ 586 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 53299 CC^TTTTGGATCAMTTC<XI3C^TCMGO^ 53240 

Cy 587 AGG^CQCCATCAGCAGQCTC^QCC^^ 646 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 53239 AA(^QCW\TAAGA(^CTAGATGAQ^ 53180 

Cy 647 A«3ATCAGAGACAGrCTGAffiTAGCAGAQ^ 706 

0b 53,73 kA^^^k^UUWy^VW^G^^ 53,20 

Cy 707 GrCTGGAAMGCGn7\TCCTQCTM 756 

Db 53119 a^T«3AAAAC^^ 



RESULT 15 

X63757 

LOCUS 

DEFI Nl Tl CN 
ACCESS! ON 
VERSI CN 
KEYWORDS 
SOURCE 
ORGANI SM 

REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 
PUB^D 

REFERENCE 
AUTHORS 
Tl TLE 
JOURNAL 

CCMVENT 
FEATURES 



X63757 1920 bp DNA linear BCT 1 8- APR- 2005 

B. rregat er i um genes spol I AA, spol I AB, spol I AC, pbp and spoVA. 
X63757 S46395 
X63757. 1 Q : 39650 

pbp gene; spol I AA gene; spol I AB gene; spol I AC gene; spoVA gene. 
Baci I I us rregat er i um 
Baci I I us rregat er i um 

Bacteria; Firm cut es; Baci I I al es; Baci I I aceae; Bacillus. 
1 

Tao.Y. P. , Hudspet h, D. S. and Vary, P. S. 

Q oni ng and sequenci ng of t he Baci I I us rregat er i um spol I A oper on 

Bi ochi rri e 74 (7-8) , 695-704 ( 1992) 

1391049 

2 (bases 1 to 1920) 

Vary, P. S. 

Di r ect Subrri ssi on 

Subrritted ( 02- JAN- 1 992) P. S. Vary, Northern Illinois University, 
Dept of Biological Sciences, DeKal b, Illinois 60115-2861, USA 
Cn Jun 23, 2005 this sequence version replaced gi: 258183. 
Locat i on/ Qual i f i ers 
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source 1. .1920 

/ organi sm=" Baci I I us rregat er i urrl' 

/ mol _t ype=" genorri c DNA" 

/st rai n=" PV361" 

/ db_xref ="t axon: 1404" 

/ rrap=" f r agrrent E, on 339kb Not I fragrment" 

/cl one="pYP4, pYP5, pYP9" 
gene <1. .77 

/ gene=" pbp" 
CDS <1. .77 

/ gene=" pbp" 

/ codon_st ar t =3 

/ 1 r ansl _t abl e=1 1 

/protei n_i d="CAA45285. 1" 

/db_xref ="GI : 39651" 

/db_xref =" Uni Prot KB/ Tr EK/BL: G4531 9" 

/ 1 r ansl at i on=" YRCCFLVEDI OTYI LYFLSNI LNS" 
rri sc_si gnal 81 . .87 

/ not e=" CA- box" 
gene 110. .141 

/ gene=" spol I AA" 
-35_signal 110. .118 

/ gene=" spol I AA" 
- 10_si gnal 134. .141 

/ gene=" spol I AA" 
gene 159. . 164 

/ gene=" spol I AA and spol I AB" 
RBS 159. .164 

/ gene=" spol I AA and spol I AB" 
gene 172. .522 

/ gene=" spol I AA" 
CDS 172. .522 

/ gene=" spol I AA" 

/ codon_st ar t =1 

/ 1 r ansl _t abl e=11 

/ prot ei n_i d="CAA45286. 1" 

/db_xref ="GI : 580788" 

/db_xref =" GCA: P35147" 

/ db_xr ef =" I nt er Pr o: I PR002645" 

/ db_xr ef =" I nt er Pr o: I PR003658" 

/db_xref =" Uni Prot KB/ Swi ss- Prot : P35147" 

/ 1 ransl at i on=" IVBLSI ELEFKQDVL LVRLTGEL DHHTAEEL RSKI TEAI EKESI S 

HLI LNLCHLTFIVDSSGLGVI L GRYKGVHNNG3EIWVCAI SPAI ERLFNIVBGLFKI I RF 

EPNEANALQKLGVA" 
gene 522. . 965 

/ gene=" spol I AB" 
CDS 522. . 965 

/ gene=" spol I AB" 

/ codon_st ar t =1 

/ 1 r ansl _t abl e=1 1 

/protei n_i d="CAA45287. 1" 

/db_xref ="GI : 39653" 

/db_xref =" OCA: P35148" 

/ db_xr ef =" I nt er Pr o: I PR003594" 

/db_xref =" I nterPro: I PR010194" 

/db_xref =" Uni Prot KB/ Swi ss-Prot : P35148" 

/ 1 r ansl at i on=" IVKNGMMLGFSAL SQNESF ARVT VASF I TQL DPTMDELTEI KTW 

SEAVTNAI I HGYESNSDGWYI SVT L HEDGWEL M RDEQ Q DNVDEAKQPLYTTKP 

DLERSGVGFTI IVENFMDEI RVESTVL EGTTL YL KKHLTNSKALCISf' 
gene j oi n( 965. .969,978. .1739) 

/ gene=" spol I AC 
RBS 965. .969 

/ gene=" spol I AC 
CDS 978. .1739 

/ gene=" spol I AC 

/ codon_st art =1 

/t ransl _t abl e=11 

/protei n_i d="CAA45288. 1" 

/db xref ="GI : 39654" 
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/db_xref =" GCA: P35145" 

/ db_xr ef =" I nt er Pr o: I PR000943" 

/ db_xr ef =" I nt er Pr o: I PR007624" 

/ db_xr ef =" I nt er Pr o: I PR007627" 

/ db_xr ef =" I nt er Pr o: I PR007630" 

/db_xref =" I nterPro: I PR013324" 

/db_xref =" I nterPro: I PR013325" 

/db_xref =" Uni Prot KB/ Swi ss-Prot : P35145" 

/ 1 ransl at i on=" MDVEVKKNKNEPYL KDHEVKDL I KRSQGGDQ ARDTI VQKNIVRL 
VW6WQRFI NRGYEPDDLFQi GCI GL L KSVDKF DLSYDVKFSTYAVPM I GEI QRFI R 
DDGTVKVSRSL KEIVBNKI RKAKDELSKLLGRVPTVAEVAEHLDLTPEEWLAQEANRA 
PSSI HETVYENDGDPI TLLDQ ADHTEAKV\FDKI ALKEAI EELDEREKLI VYLRYYKD 
QTCSEVAARLGI SQVGVSRLEKKI L KGVKL HMMDT" 



t er rri nat or 


1750. 


1767 


t er rri nat or 


1772. 


1789 


rri sc_si gnal 


1801. 


1820 




/ not e= 


20nt - box" 


- 35_si gnal 


1829. 


1835 




/ not e= 


spoVA gene" 
1860 


- 10_si gnal 


1853. 




/ not e= 


spoVA gene" 


RBS 


1885. 


1890 




/ not e= 


spoVA gene" 


gene 


1899. 


>1920 




/ gene= 


spoVA" 


CDS 


1899. 


>1920 




/ gene= 


spoVA" 



/ codon_st art =1 
/t ransl _t abl e=11 
/ prot ei n_i d="CAA45289. 1" 
/db_xref ="GI : 580789" 
/ 1 ransl at i on=" IVENVLYL" 



Query IVbt ch 44.9% Score 339.6; DB 14; Length 1920; 

Best Local Si rri I ar i t y 66.3% Pred. No. 2e-97; 

IVatches 489; Conservative 0; Msmatches 249; I ndel s 0; Gaps 0; 



Gy 16 AAAMTGCGACTCMCCATTTCTGACCAATGACCM 75 

Db 996 AAAAATAAAAACGAACCGTATTTAM 1055 

Gy 76 CAAGCTG3CGATACGGATGCACGTGAGCTTCT 135 

Db 1056 CAQCMGGTGATCAMTTGC^^ 

Gy 136 TCCGTCGTCCAGCGGTTTATCMCCGCG33TATGAAQCG3ATGATTTGTT^ 195 

me v^T^MWkk^^^iiMy^wi^ ,,75 

Gy 196 TGCATTQ3CTTGGTGAAGQCCGTTGACMGTTCGATCTT^ 255 

II II II I I III IIIIMI II II II I Mill Mill I MUM 

Db 1176 TGn"ATAGGGCTATTAAAGn"CCGTTGATAMTTTG^CTTATCGTATGATGTTAAATTTTCA 1235 

Gy 256 ACCTATGCGGTGCXIAATGATCATCGG^GAAATT 3 1 5 

II I I I I I II I I I I I I I I II II I I I I I II I II III I II I I I I I II III 

Db 1236 ACATATGCTGTTCXAATGATTATTGGTGAAATTCAGOG^TTTATTCGTGATGATGGCACG 1295 

Gy 316 GTTMG3TGAGTCX^TCGTTAAMG4MCAGCGW 375 

I I I I I I I I I I I I I I I I I I I II I II II III I I I I I I I I I 

Db 1296 GTAMG3TTAGCCGTTCTCTGAMGAMTGAQC^ 1355 

Gy 376 TTGTAGMGGAATTCGGCCGTGCCCCCACGATC^ 435 

I I I I I I I I I II I II I I I II I I I I I I I I I I I I III II 

Db 1356 CTTTCCAMTTACTTGG^CQCX^C^ 1415 

Gy 436 ACGCCGGAGGAAGTAGTCTTTGCGCMGAGGCMGCAGA& 495 

Db 1416 ACTCCTGAAGAAGTAGTGCTAGCT 1475 
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0/ 496 AOX3riTTTOW\ATGA0Gr^^ 555 

1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II II I II II II II III I 

Db 1476 ACm^CTATG^AMTGATQ3^G^TCCMTTACa:TTCTTGACCAAATTQa^G^TGATACX3 1535 

Cy 556 CTGAAGMCTQCTTTGAGAAMTTQXTTGAAQ^CQCGATGAQ3AQ3CTGAQ3^Q^ 615 

Db 1536 GAAG3G^CTGCTTT 1595 

Cy 616 ^QGAQarGATCGn"CTACCTQCQCTATTACAAQ3^TCAG^GACAGn"CT^QGTAQGA^G 675 

II I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I II II II I 
Db 1596 GAAMQGTGATTGTCTATTTGAQGfrATTACAMG^TCAMCTCACTCCG^CTQQGCQ3A 1655 

Cy 676 CCTCTAa^ATTTOQCAGGTCCAGGTCTa^ 735 

Db 1656 AC^OaC^TTTCGCAC^ 1715 

Cy 736 AAQC^QCAAATTGAACAT 753 

Db 1716 AACCTQ3ATATCV\ATGAT 1733 

Search corrpleted: Noverrber 4, 2008, 17:36:32 
Job t i rre : 1553 sees 
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